; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040324 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040324
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr13:3856459..3857316
RNA-Seq ExpressionLag0040324
SyntenyLag0040324
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601276.1 hypothetical protein SDJN03_06509, partial [Cucurbita argyrosperma subsp. sororia]1.0e-9491.96Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA
        ISFVGAEI LLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGA AMTVLSMVGSILYYWAHS+ADTGGWQK QNE GG+ +AAP HDLKQNA++DKA
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA

XP_022957387.1 uncharacterized protein LOC111458800 isoform X1 [Cucurbita moschata]5.0e-9491.46Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA
        ISFVGAEI LLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGA AMTVLSMVGSI YYWAHS+ADTGGWQK QNE GG+ +AAP HDLKQNA++DKA
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA

XP_022986328.1 uncharacterized protein LOC111484079 [Cucurbita maxima]7.7e-9592.42Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDK
        ISFVGAEI LLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGA AMTVLSMVGSILYYWAHS+ADTGGWQK QNE GG+ +AAPVHDLKQNA++DK
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDK

XP_023514518.1 uncharacterized protein LOC111778772 [Cucurbita pepo subsp. pepo]2.2e-9491.96Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA
        ISFVGAEI LLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSILYYWAHS+ADTGGWQK QNE GG+ +AAPVHDLKQNA +DKA
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA

XP_038892755.1 uncharacterized protein LOC120081728 [Benincasa hispida]1.8e-9189.95Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVLAT+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE+TYC+YGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA
        ISFVGAEIALLAGSARNAYHTKYRAAFG E+LSC TLRKGVFAGAAAMTVLSMVGSILYYW HS+ADTGGW+KHQNEG G  MAA VHDLKQN + +KA
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA

TrEMBL top hitse value%identityAlignment
A0A0A0KWH0 Uncharacterized protein2.7e-8586.29Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MAVSAIVLAT+ SLHLIAFVLAVGAERRRSTA +VPDEYDE+TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQ-NEGGGLAMA-APVHDLKQNAR
        ISF+GAEI LLAGSARNAYHTKYRA FG E LSCATLRKGVFAGAAAMTVLSMVGSILYYW HS+ADTGGW+KHQ NEG G+  A   +H  +QNA+
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQ-NEGGGLAMA-APVHDLKQNAR

A0A1S3BF21 uncharacterized protein LOC1034893085.8e-8889.12Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MAVSAIVLAT+ SLHLIAFVLAVGAERRRSTA VVPDEYDE+TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVH-DLKQ
        ISF+GAEI LLAGSARNAYHTKYRA FG E LSCATLRKGVFAGAAAMTVLSMVGSIL+YW HS+ADTGGW+KHQNEG G+  AA VH D+KQ
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVH-DLKQ

A0A5D3CD50 Uncharacterized protein5.8e-8889.12Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MAVSAIVLAT+ SLHLIAFVLAVGAERRRSTA VVPDEYDE+TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVH-DLKQ
        ISF+GAEI LLAGSARNAYHTKYRA FG E LSCATLRKGVFAGAAAMTVLSMVGSIL+YW HS+ADTGGW+KHQNEG G+  AA VH D+KQ
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVH-DLKQ

A0A6J1GYZ6 uncharacterized protein LOC111458800 isoform X12.4e-9491.46Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA
        ISFVGAEI LLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGA AMTVLSMVGSI YYWAHS+ADTGGWQK QNE GG+ +AAP HDLKQNA++DKA
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA

A0A6J1JG73 uncharacterized protein LOC1114840793.7e-9592.42Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVL T+ SLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDK
        ISFVGAEI LLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGA AMTVLSMVGSILYYWAHS+ADTGGWQK QNE GG+ +AAPVHDLKQNA++DK
Subjt:  ISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52910.1 Protein of unknown function (DUF1218)2.4e-3346.51Show/hide
Query:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S +V+  V  L LIA  LA+ AE+RRS  KVVPD   E  +C YG+D +T YG  AF LL ISQ ++   +RCFCCGK L  G +    I  F+  W+ F
Subjt:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIALLAGSARNAYHTKYRAAFGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKH
        + AE+ LLAGS RNAYHT YR  +  E+  SC  +RKGVFA  A+  + + + S  YY ++SRA  G    H
Subjt:  VGAEIALLAGSARNAYHTKYRAAFGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKH

AT1G61065.1 Protein of unknown function (DUF1218)6.4e-3146.63Show/hide
Query:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S ++L  V    LIAF LAV AE+RR+T ++  +  D  +YCVY  D +T  G+ +F +LL SQ ++   +RC CCG+ L    + + AIF F+ +W+ F
Subjt:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA
          A++ LLAGS RNAYHTKYR  FG    SC +LRKGVF   AA  VL+ + S LYY   SRA
Subjt:  VGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA

AT1G68220.1 Protein of unknown function (DUF1218)2.6e-6464.25Show/hide
Query:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTV-AIFFFVFS
        MAVS  +L  V +LHL+AFV A GAERRRSTA  VPD+YDE+T C YGT+ASTVYG+SAFGLLL+SQAVVNGVT+C C GKGL++G + TV AI FFV S
Subjt:  MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTV-AIFFFVFS

Query:  WISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQ
        W+SF+GAE  LL GSARNAYHTK    + G++LSCA L  GVFA  AA T++S++ +ILYY AHS+ADTGGW+KHQN+G  + M  P    KQ
Subjt:  WISFVGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQ

AT3G15480.1 Protein of unknown function (DUF1218)4.7e-3448.78Show/hide
Query:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S +V+  V  L LIA  LA+ AE+RRS  KV  D   +  YCVYGTD +T YG  AF LL +SQ ++   +RCFCCGK L  G +   AI  F+  W+ F
Subjt:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIALLAGSARNAYHTKYRAAFGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA
        + AE+ LLA S RNAYHT+YR  +  ED  SC  +RKGVFA  AA T+ + + S  YY  +SRA
Subjt:  VGAEIALLAGSARNAYHTKYRAAFGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA

AT4G27435.1 Protein of unknown function (DUF1218)1.1e-3349.08Show/hide
Query:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S IV A V   +LIAF LAV AE+RRSTA+VV D   +  YCVY +D +T YG+ AF   + SQ ++  V+RCFCCGK L  G +  +A+  F+ SW+ F
Subjt:  SAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA
        + AEI LLAGS  NAYHTKYR  F      C TLRKGVFA  A+    + + S  YY+ +  A
Subjt:  VGAEIALLAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTTTCAGCCATTGTTTTGGCCACCGTTGTTTCTTTGCATCTCATCGCCTTCGTTCTCGCCGTCGGCGCCGAGCGCCGCCGCAGCACCGCTAAGGTTGTGCCGGA
CGAGTACGACGAACAGACGTACTGCGTGTACGGCACCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTGATAAGCCAGGCGGTGGTTAACGGCGTTA
CAAGATGTTTCTGCTGCGGAAAGGGTTTAATCAGTGGAAAAGCCACCACCGTCGCCATTTTCTTCTTTGTCTTTTCCTGGATCAGCTTCGTGGGGGCAGAGATTGCGCTG
TTGGCCGGATCGGCGAGGAATGCGTACCACACGAAGTACAGGGCGGCGTTCGGCGGCGAAGATCTGTCGTGTGCGACACTCCGGAAGGGGGTGTTCGCCGGCGCCGCCGC
GATGACGGTGCTGTCGATGGTGGGTTCGATATTGTACTATTGGGCACATTCGAGAGCCGACACAGGAGGATGGCAGAAGCACCAGAATGAAGGCGGCGGCTTGGCAATGG
CGGCGCCTGTTCATGATCTCAAACAAAATGCTCGATTGGACAAGGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTTTCAGCCATTGTTTTGGCCACCGTTGTTTCTTTGCATCTCATCGCCTTCGTTCTCGCCGTCGGCGCCGAGCGCCGCCGCAGCACCGCTAAGGTTGTGCCGGA
CGAGTACGACGAACAGACGTACTGCGTGTACGGCACCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTGATAAGCCAGGCGGTGGTTAACGGCGTTA
CAAGATGTTTCTGCTGCGGAAAGGGTTTAATCAGTGGAAAAGCCACCACCGTCGCCATTTTCTTCTTTGTCTTTTCCTGGATCAGCTTCGTGGGGGCAGAGATTGCGCTG
TTGGCCGGATCGGCGAGGAATGCGTACCACACGAAGTACAGGGCGGCGTTCGGCGGCGAAGATCTGTCGTGTGCGACACTCCGGAAGGGGGTGTTCGCCGGCGCCGCCGC
GATGACGGTGCTGTCGATGGTGGGTTCGATATTGTACTATTGGGCACATTCGAGAGCCGACACAGGAGGATGGCAGAAGCACCAGAATGAAGGCGGCGGCTTGGCAATGG
CGGCGCCTGTTCATGATCTCAAACAAAATGCTCGATTGGACAAGGCGTAG
Protein sequenceShow/hide protein sequence
MAVSAIVLATVVSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISFVGAEIAL
LAGSARNAYHTKYRAAFGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWAHSRADTGGWQKHQNEGGGLAMAAPVHDLKQNARLDKA