; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018566 (gene) of Snake gourd v1 genome

Gene IDTan0018566
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG04:6957987..6959355
RNA-Seq ExpressionTan0018566
SyntenyTan0018566
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601276.1 hypothetical protein SDJN03_06509, partial [Cucurbita argyrosperma subsp. sororia]7.7e-9592.46Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA
        ISFVGAEIGLLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSILYYW HSKADTGGW+K QNEG G+  AAP HDLKQNAQ++KA
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA

XP_022957387.1 uncharacterized protein LOC111458800 isoform X1 [Cucurbita moschata]3.8e-9491.96Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA
        ISFVGAEIGLLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSI YYW HSKADTGGW+K QNEG G+  AAP HDLKQNAQ++KA
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA

XP_022986328.1 uncharacterized protein LOC111484079 [Cucurbita maxima]7.7e-9592.93Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEK
        ISFVGAEIGLLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSILYYW HSKADTGGW+K QNEG G+  AAPVHDLKQNAQ++K
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEK

XP_023514518.1 uncharacterized protein LOC111778772 [Cucurbita pepo subsp. pepo]5.9e-9592.46Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA
        ISFVGAEIGLLAGSARNAYHTKYRAA+GGEDLSCATLRKGVFAGA AMTVLSMVGSILYYW HSKADTGGW+K QNEG G+  AAPVHDLKQNA ++KA
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA

XP_038892755.1 uncharacterized protein LOC120081728 [Benincasa hispida]2.5e-9391.96Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDE+TYC+YG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA
        ISFVGAEI LLAGSARNAYHTKYRAA G E+LSC TLRKGVFAGAAAMTVLSMVGSILYYW HSKADTGGWEKHQNEGVG+  AA VHDLKQN Q EKA
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA

TrEMBL top hitse value%identityAlignment
A0A0A0KWH0 Uncharacterized protein2.6e-8888.83Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVLATITSLHLIAFVLAVGAERRRSTA +VPDEYDE+TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQ-NEGVGLATA-APVHDLKQNAQ
        ISF+GAEIGLLAGSARNAYHTKYRA  G E LSCATLRKGVFAGAAAMTVLSMVGSILYYW HSKADTGGWEKHQ NEGVG+ TA   +H  +QNAQ
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQ-NEGVGLATA-APVHDLKQNAQ

A0A1S3BF21 uncharacterized protein LOC1034893081.4e-8989.85Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVLATI SLHLIAFVLAVGAERRRSTA VVPDEYDE+TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVH-DLK-QNAQ
        ISF+GAEIGLLAGSARNAYHTKYRA  G E LSCATLRKGVFAGAAAMTVLSMVGSIL+YW HSKADTGGWEKHQNEGVG+  AA VH D+K QNAQ
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVH-DLK-QNAQ

A0A5D3CD50 Uncharacterized protein1.4e-8989.85Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MA+SAIVLATI SLHLIAFVLAVGAERRRSTA VVPDEYDE+TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGK TTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVH-DLK-QNAQ
        ISF+GAEIGLLAGSARNAYHTKYRA  G E LSCATLRKGVFAGAAAMTVLSMVGSIL+YW HSKADTGGWEKHQNEGVG+  AA VH D+K QNAQ
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVH-DLK-QNAQ

A0A6J1GYZ6 uncharacterized protein LOC111458800 isoform X11.9e-9491.96Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA
        ISFVGAEIGLLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSI YYW HSKADTGGW+K QNEG G+  AAP HDLKQNAQ++KA
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA

A0A6J1JG73 uncharacterized protein LOC1114840793.7e-9592.93Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
        MALSAIVL TITSLHLIAFVLAVGAERRRSTAKVVPDEYDE TYCVYG+DASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSW

Query:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEK
        ISFVGAEIGLLAGSARNAYHTKYRAA GGEDLSCATLRKGVFAGA AMTVLSMVGSILYYW HSKADTGGW+K QNEG G+  AAPVHDLKQNAQ++K
Subjt:  ISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52910.1 Protein of unknown function (DUF1218)6.8e-3345.93Show/hide
Query:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S +V+  +  L LIA  LA+ AE+RRS  KVVPD   E  +C YGSD +T YG  AF LL ISQ ++   +RCFCCGK L  G +    I  F+  W+ F
Subjt:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIGLLAGSARNAYHTKYRAALGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKH
        + AE+ LLAGS RNAYHT YR     E+  SC  +RKGVFA  A+  + + + S  YY ++S+A  G    H
Subjt:  VGAEIGLLAGSARNAYHTKYRAALGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKH

AT1G61065.1 Protein of unknown function (DUF1218)2.4e-3045.4Show/hide
Query:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S ++L  +    LIAF LAV AE+RR+T ++  +  D  +YCVY  D +T  G+ +F +LL SQ ++   +RC CCG+ L    + + AIF F+ +W+ F
Subjt:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA
          A++ LLAGS RNAYHTKYR   G    SC +LRKGVF   AA  VL+ + S LYY T S+A
Subjt:  VGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA

AT1G68220.1 Protein of unknown function (DUF1218)4.4e-6462Show/hide
Query:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTV-AIFFFVFS
        MA+S  +L  +T+LHL+AFV A GAERRRSTA  VPD+YDE+T C YG++ASTVYG+SAFGLLL+SQAVVNGVT+C C GKGL++G + TV AI FFV S
Subjt:  MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTV-AIFFFVFS

Query:  WISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLK-QNAQLEK
        W+SF+GAE  LL GSARNAYHTK      G++LSCA L  GVFA  AA T++S++ +ILYY  HSKADTGGWEKHQN+G+ +    P    K QN +  K
Subjt:  WISFVGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLK-QNAQLEK

AT3G15480.1 Protein of unknown function (DUF1218)1.5e-3246.95Show/hide
Query:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S +V+  +  L LIA  LA+ AE+RRS  KV  D   +  YCVYG+D +T YG  AF LL +SQ ++   +RCFCCGK L  G +   AI  F+  W+ F
Subjt:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIGLLAGSARNAYHTKYRAALGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA
        + AE+ LLA S RNAYHT+YR     ED  SC  +RKGVFA  AA T+ + + S  YY  +S+A
Subjt:  VGAEIGLLAGSARNAYHTKYRAALGGED-LSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA

AT4G27435.1 Protein of unknown function (DUF1218)3.1e-3348.47Show/hide
Query:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF
        S IV A +   +LIAF LAV AE+RRSTA+VV D   +  YCVY SD +T YG+ AF   + SQ ++  V+RCFCCGK L  G +  +A+  F+ SW+ F
Subjt:  SAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISF

Query:  VGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA
        + AEI LLAGS  NAYHTKYR         C TLRKGVFA  A+    + + S  YY+ +  A
Subjt:  VGAEIGLLAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTCTCCGCCATTGTTTTGGCCACCATCACTTCTCTGCATCTCATCGCCTTCGTCCTCGCCGTCGGCGCCGAGCGCCGCCGCAGCACCGCTAAGGTTGTGCCGGA
TGAGTACGACGAACAGACATACTGCGTGTACGGCTCCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTGATAAGCCAGGCGGTGGTTAACGGCGTCA
CGAGATGTTTCTGCTGTGGAAAGGGTTTAATCAGTGGAAAAGCCACCACTGTCGCCATTTTCTTCTTTGTCTTTTCCTGGATCAGCTTTGTGGGAGCGGAGATCGGGCTG
TTGGCCGGATCGGCGAGGAATGCGTATCACACGAAGTACAGGGCGGCGTTGGGCGGCGAAGATTTGTCGTGTGCGACACTCCGGAAGGGGGTGTTCGCCGGCGCCGCCGC
GATGACAGTGCTGTCGATGGTGGGTTCGATTTTGTACTATTGGACACATTCGAAAGCTGACACAGGAGGATGGGAGAAACACCAGAATGAAGGCGTCGGCTTGGCAACGG
CGGCGCCTGTTCATGATCTGAAACAAAATGCTCAATTGGAGAAGGCGTAG
mRNA sequenceShow/hide mRNA sequence
AATCATCGCCGAATTTCAACAAAATCAGATTAAATTTCAAATTTTGAAGCCTCCCCATTGTTCTAAGCTTCAATTTCCTTTAAGATTCAAAAAAAAAAAAAAAAAAAGAA
GCAAGCAAGCTAATAATGGCACTCTCCGCCATTGTTTTGGCCACCATCACTTCTCTGCATCTCATCGCCTTCGTCCTCGCCGTCGGCGCCGAGCGCCGCCGCAGCACCGC
TAAGGTTGTGCCGGATGAGTACGACGAACAGACATACTGCGTGTACGGCTCCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTGATAAGCCAGGCGG
TGGTTAACGGCGTCACGAGATGTTTCTGCTGTGGAAAGGGTTTAATCAGTGGAAAAGCCACCACTGTCGCCATTTTCTTCTTTGTCTTTTCCTGGATCAGCTTTGTGGGA
GCGGAGATCGGGCTGTTGGCCGGATCGGCGAGGAATGCGTATCACACGAAGTACAGGGCGGCGTTGGGCGGCGAAGATTTGTCGTGTGCGACACTCCGGAAGGGGGTGTT
CGCCGGCGCCGCCGCGATGACAGTGCTGTCGATGGTGGGTTCGATTTTGTACTATTGGACACATTCGAAAGCTGACACAGGAGGATGGGAGAAACACCAGAATGAAGGCG
TCGGCTTGGCAACGGCGGCGCCTGTTCATGATCTGAAACAAAATGCTCAATTGGAGAAGGCGTAGAGCTAGGTTTTTGGTTTTGTGAGATGATTATTATTATATTATATT
GGAGAAGGACAGAGTTTAGTGTTTGTTTCTAGGGAAAAAAAGAAAAAAGGGAAATGTGTTAATCACTTGTATTAATCATGAATGAACCTTTTTTCTTTTTTAATTTTTTG
TCTCGTATTTTAAGGAGTTTTCTTTTTCTTTTACGACGCGTCGTTTCGAAGTTTTGAATGTTTGATTCTGCTCCGATGATGGGATTTTGCTCCATTGGTTGCTAGGGGTG
AATCTTGATTTTTTTGAGAAATTGTTGGGAGGCATTTGACGCCTAAGGTTGAGTTAATGAGACAAAATTGAGCTTAGTTTGTGAAGTTTGTATTTAGAGTGCAAATTTGG
ATTGAGTTGAATTAGAAATTTTGTGTTTGAAATGTTGAATTGAGTTGAATTGTCCTGTA
Protein sequenceShow/hide protein sequence
MALSAIVLATITSLHLIAFVLAVGAERRRSTAKVVPDEYDEQTYCVYGSDASTVYGLSAFGLLLISQAVVNGVTRCFCCGKGLISGKATTVAIFFFVFSWISFVGAEIGL
LAGSARNAYHTKYRAALGGEDLSCATLRKGVFAGAAAMTVLSMVGSILYYWTHSKADTGGWEKHQNEGVGLATAAPVHDLKQNAQLEKA