; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G014660 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G014660
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBSD domain-containing protein
Genome locationchr04:22384634..22388878
RNA-Seq ExpressionLsi04G014660
SyntenyLsi04G014660
Gene Ontology termsNA
InterPro domainsIPR005607 - BSD domain
IPR035925 - BSD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649890.1 hypothetical protein Csa_011920 [Cucumis sativus]4.6e-11591.38Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIE LAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKDSSDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        D+SPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKN D+ ST GSSSK +VQNYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

TYK20126.1 putative BSD domain-containing protein [Cucumis melo var. makuwa]6.4e-11792.67Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKD SDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        DSSPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKNED+NST GSSSKFL +NYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

XP_004141294.2 uncharacterized protein LOC101202841 isoform X1 [Cucumis sativus]4.6e-11591.38Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIE LAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKDSSDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        D+SPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKN D+ ST GSSSK +VQNYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

XP_008452671.1 PREDICTED: uncharacterized protein LOC103493619 [Cucumis melo]6.4e-11792.67Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKD SDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        DSSPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKNED+NST GSSSKFL +NYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

XP_038897732.1 uncharacterized protein LOC120085672 [Benincasa hispida]7.8e-11591.81Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQV AARSMWMQELQKQTKPESYWC  RDTFELKDSS VLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
         SSPMAFHDTHSGSTLPWTFT EPS+SS+SSNYET+KYQ ES+E  FIDKSVIVE+PIIKN+DRNS  GSSSKFLVQNYEDESDNDWLEEDS+LGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDL+DDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

TrEMBL top hitse value%identityAlignment
A0A0A0L3U8 BSD domain-containing protein2.2e-11591.38Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIE LAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKDSSDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        D+SPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKN D+ ST GSSSK +VQNYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

A0A1S3BUE8 uncharacterized protein LOC1034936193.1e-11792.67Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKD SDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        DSSPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKNED+NST GSSSKFL +NYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

A0A5A7V7Z5 Putative BSD domain-containing protein3.1e-11792.67Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKD SDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        DSSPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKNED+NST GSSSKFL +NYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

A0A5D3D985 Putative BSD domain-containing protein3.1e-11792.67Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAE+LSTPQVAAARSMWMQELQKQTKPESYW   RDTFELKD SDVLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
        DSSPMAFHDTHSGSTLPWTFT EPSMSS+SSNYET+KYQ+ES+ET FIDKSVIVEKP+IKNED+NST GSSSKFL +NYE+ESDNDWLEEDSELGGCNGT
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

A0A6J1CHA3 uncharacterized protein LOC1110111795.3e-10987.5Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE
        MSDAQKEHAFTIERLAPRLAALR ELCPCHMS+SYFWKVYFVLLHSRLNKHDAEVLSTPQV AARSMWMQELQKQTKPE+YWC  RDTFELKDSS VLQE
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQE

Query:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT
            MAFHDTHSGSTLPWTFT EPS SS SSNYET+KY IES+ET FIDKSVIVEKP IKNEDR+S  GSS+KFLVQNYEDESDNDWLEEDSELGGCN  
Subjt:  DSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGT

Query:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE
        ILPLEN+EDISFSDL+DDDMVLP+K KIASKE
Subjt:  ILPLENEEDISFSDLDDDDMVLPAKFKIASKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10720.1 BSD domain-containing protein2.1e-4145.8Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVL--
        MSDAQ+ HA  IERLAPRLAALRIELCPCHMS  YFWKVYFVLL SRLNKHDA +LS+PQV  AR++WM+ELQ QT               K+S D++  
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVL--

Query:  QEDSSPMA---FHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELG
        +ED +P     ++        P  + +EP     S  Y   ++  E+ +  FIDK+VI EKPI KN+  +++   +SK +V    D+ D+DW EE+    
Subjt:  QEDSSPMA---FHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELG

Query:  GCNGTILPLENEEDISFSDLDDDDMV--LPAKFKIASK
          +   +   NE+D+SFSDL+ DD +  L  K KI SK
Subjt:  GCNGTILPLENEEDISFSDLDDDDMV--LPAKFKIASK

AT1G26300.1 BSD domain-containing protein7.0e-0532.14Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVL
        +SD Q+ HA  +     +++ LR ELCP  M E  FW++YF L+ + ++ ++ + +
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVL

AT2G10950.1 BSD domain-containing protein7.3e-1037.78Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFE
        +S+AQ+ HA  IE L P L A++ ++   +M + +FW +YF+LL  RLN HD E+L+T +V   R   + +LQK+    S   +  +T E
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFE

AT3G49800.1 BSD domain-containing protein1.2e-3338.82Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSD--VL
        M+DAQ EHA  +E LA  LAALRIELCP +MSE  FW++YFVL+H   +KHDA  LSTPQV  +R++   EL ++          +DT  + +SSD    
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSD--VL

Query:  QEDSSPMAFHDTHSGSTLP---WTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDR---NSTTGSSSKFLVQNYEDESDNDWLEEDS
         E+  P+      S  + P    T T E   S+  S +ET+K+ +E+ E   +DK VI E+P     D+   +  TGSS + +    +D++D DWL+++ 
Subjt:  QEDSSPMAFHDTHSGSTLP---WTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDR---NSTTGSSSKFLVQNYEDESDNDWLEEDS

Query:  ELGGCNGTI--LPLENEEDISFSDLDDDDMVLPAKFK
          G  + T   L  + +ED+SFSDL++DD  +P  +K
Subjt:  ELGGCNGTI--LPLENEEDISFSDLDDDDMVLPAKFK

AT5G65910.1 BSD domain-containing protein1.9e-3442.36Show/hide
Query:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTK-PESYWCSRRDTFELKDSSDVLQ
        ++DAQ EHA  +ERLAP LA+LRIELCP +M+E+ FW++YFVL+H +L+K  A +LSTPQV  ARSM  QELQK++K P     S  +T        +++
Subjt:  MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTK-PESYWCSRRDTFELKDSSDVLQ

Query:  EDSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESD---NDWLEED-----
          + P +          P T   +      SS+ ETDK+ IES E   +DKSVI        E+R+++T SSS+F+    +DE D   +DWL ++     
Subjt:  EDSSPMAFHDTHSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESD---NDWLEED-----

Query:  SELGGCNGTILPL-ENEEDISFSDLDDDD
        S +GG + T  P  E+EED+SFSDL+++D
Subjt:  SELGGCNGTILPL-ENEEDISFSDLDDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGCCCAAAAAGAGCATGCTTTCACTATTGAACGTCTTGCTCCTAGATTAGCTGCTCTCAGAATTGAGCTCTGTCCTTGCCATATGAGTGAGAGTTACTTTTG
GAAAGTCTATTTTGTGCTCCTGCATTCAAGACTCAATAAGCATGATGCAGAGGTTTTATCTACTCCACAGGTAGCAGCAGCGAGATCAATGTGGATGCAGGAATTACAAA
AGCAGACCAAGCCAGAGTCTTACTGGTGTAGTAGAAGAGACACTTTTGAATTAAAAGACAGCTCCGATGTGCTGCAGGAAGACAGTAGTCCCATGGCATTCCATGATACT
CATTCTGGATCGACGCTGCCTTGGACATTTACATATGAGCCAAGCATGTCGTCTATGTCAAGCAATTATGAGACAGATAAATATCAGATAGAGAGTACTGAAACGCACTT
CATTGATAAGTCTGTTATTGTCGAAAAACCTATAATTAAGAATGAGGATAGAAACTCGACAACTGGGTCTTCTTCAAAATTCCTTGTTCAGAACTATGAGGATGAGTCAG
ACAATGACTGGCTAGAGGAAGACTCTGAGTTGGGTGGTTGTAATGGGACCATCCTCCCTCTGGAGAACGAAGAAGATATTTCTTTCAGTGATCTTGATGATGATGATATG
GTTCTGCCGGCTAAATTCAAGATTGCTTCAAAAGAATAG
mRNA sequenceShow/hide mRNA sequence
AAATAATTATAATAGGCGGAACCGATGAACTGACAAAGGCCATATCAAGGGAGGAAGTCTGAAGCAGTCACTTTCTTCACCATCCGAATCGGATACAATGTCATGGTTGG
CTCGCTCCATTGCCAACACCCTTCGCCTCGAAGACGAAGACGACGATCACAACGGCGTCGTTTCGCCCATTCCCTCCGATCCTCCTTCTCCCTCTACCACTCCACGCAAT
CAGATGGATTCCCAATCCGAACTCGACGACGAAGCATTATCTCGCGGTGTCAAAGAGGATTTGACTGAATTCAAACAAACCCTAACCCGCCAATTTTGGGGTGTCGCCTC
TTTCCTTGCTCCGCCGCCGCCGCCGCCGCCGCCCCATCCTTCCTTCACGTCTTCTCATCCCCGGCTGGGCGGGGATCTGGCTGCCCCTCCCGGTTGGATGCCGCTCGAGC
CGTCTAATCAATCTGATCCGTCGATCTCCGGGGACGAGGAGGACGAAGACGACCCATCCGATCCGGTCGAGGTCTTGAAGATGCGTTCGAATTATGACGCGTATGCGAAA
TCCGGGAATTTACAGGGGGAATGCTATGAGGAGGTGGATTGGGGAGATGCTGTTGGGATTACTGATGAAGTGCTGACGTTTGCGACGAACATTGCAATGCACCCTGAGAC
TTGGATTGATTTCCCAATTGACGAGGAGGAGGACAACGATGCTCCGGGGATGCTAGTAATTTAGCTTGGGGTACGCTGACTGCCGCAGTTAGATCACCACAGTTAGCACA
AAAAGCTAAAAGATGTATCCTAATTTAGCGACTATTTTTTTAAAATAGATTTGCTATCTTTATTGGCGCCGTAGTGGTCCTCCTAATACAATAGATGATGGGTATTAATT
TTTTTTGGATTTAACTTAGTAAGCCCTTTTGGACTTGAGGATAGTATTTTTGGGTCACGCCTCAGCTTTTATATTGCTTCGGTCACCTGACTTACTAAGACACAGGGGAT
TACTTAAAGCTGCATTATCATGTGCAGTGCAATCTAGAAATTGTAGACCTGCACTTTTATATCATTCCATATGATTTCCTGCTAAGGCATTCCGGTCCCCATTTTATCAC
CCTTTATCTGCTTCTTGATGTCTATTACTTTTCCTACTTTTGGTTTTGCTATGTTCAGAAGCTGTAATATTGTGGTCTCAGTGCTATACAGCACTTACGTTCACTTGTTC
CATTTACGTGTACTTTCTAAATCATTCATTTAATGTCCCAATGCACTTATTTTGTGATAGGAAGGATAAACCCCCTATTCAGTTCATTTACACTTGTAAAGCTTCATGAA
TCTAACTCTAAGTGAGTTTCATTATGTAACGTGCAGTTTGTCAAGTGTTGGAGTATCTCTCGTGTGTAGGTTGTAGAGCGCATGATCCTTTACGTGATGTGTGACCCCTT
TTATCTAATATGACTAGTAGCAGCATAGAACTCACATAGGTGAAGATGGTGAAGTGTTTGCTTTGAGGAGTTAGGCCTCTTCAAAAGATGGGAACTTGTTGGTTTTCCTT
CGTATATCAACATAATTAGCATATCATATATTGCATTTGAGGTTCTATCGATTATGTTCATCATTTATAGTGCATATATGGTGTTTCAGTCTTATACAATCAAGTATGTG
TGACCAAGTGTTTTAAACTTCAAGATTACACCTTATACTGAATTTAATCATAATTCCATCTCTTATATCTGGATATGTAAGATGAATGAGAAATAAATGATATATGGTGA
TCCACTTATATGATTATATGCTGACTTTTGAATGGTAAAATTGACCTTTGGTGTTGTTTAACTTTCTGTTTCTATGATGTCTTTTCACATTGTTGAACCCTGTATATTTA
CGTCTTTAGACCCTTCTTTTTCTTTTTTTAGATTTTGAAATGTCTGATGCCCAAAAAGAGCATGCTTTCACTATTGAACGTCTTGCTCCTAGATTAGCTGCTCTCAGAAT
TGAGCTCTGTCCTTGCCATATGAGTGAGAGTTACTTTTGGAAAGTCTATTTTGTGCTCCTGCATTCAAGACTCAATAAGCATGATGCAGAGGTTTTATCTACTCCACAGG
TAGCAGCAGCGAGATCAATGTGGATGCAGGAATTACAAAAGCAGACCAAGCCAGAGTCTTACTGGTGTAGTAGAAGAGACACTTTTGAATTAAAAGACAGCTCCGATGTG
CTGCAGGAAGACAGTAGTCCCATGGCATTCCATGATACTCATTCTGGATCGACGCTGCCTTGGACATTTACATATGAGCCAAGCATGTCGTCTATGTCAAGCAATTATGA
GACAGATAAATATCAGATAGAGAGTACTGAAACGCACTTCATTGATAAGTCTGTTATTGTCGAAAAACCTATAATTAAGAATGAGGATAGAAACTCGACAACTGGGTCTT
CTTCAAAATTCCTTGTTCAGAACTATGAGGATGAGTCAGACAATGACTGGCTAGAGGAAGACTCTGAGTTGGGTGGTTGTAATGGGACCATCCTCCCTCTGGAGAACGAA
GAAGATATTTCTTTCAGTGATCTTGATGATGATGATATGGTTCTGCCGGCTAAATTCAAGATTGCTTCAAAAGAATAGGGAAGTTCAACAAGAAGAACAAAAAATTCAAG
GTGTTGCTTCCGATTGTGTATGCTTTGCTGATGAGGCCAAATGAATTACAAAGTCAGTCTGACAAGATTGTTGGGTTTCCGATACGGATCCTATAAAACCATCTGAAATT
GCTTTTAGTGGGTTGGGTGCAATTGTAAATGATACGAATGATCATTCCAATTTGGAAGAGTGTATCATGATAACGTGTTTCTATTTGGTTTGTAGTTTGGGAATTGTGTA
AAGAATGTATCAGTAAATAGACAAAAAGGGAAAAGATACAAGATTCTTTTTGTTCAATACCATACCCAACGATGATTGTCCTCATCCTTTTGGTCTGTCAGGCATTCTCA
AATACTTAAAAAATCATATTCTGTCAGGAATGGAGATTCATCGGAAGCTGTCTGACAGAAGAAACCGCTAAAAACAGCCGAAGAAGCCACCGAATTATACAACTGAGGCA
GTTGAGAAAGAGGGTTCTGTTGGATTTCTATATTTGAATTTTGAAGTGAAGGGTGGCCTCATTTTGAAGGAAGTTGCGTCTCTGTCTTGTCCATAACAGGTATTTACAAA
ACGACCATAATGGACCATTCTTTTTTTCTCCTCTTATTTCATTGCTGTACTTTTACCTCTCTATTCCATGAAGATATATAGCTTTAGTTATCATATTCACAGATAGGACT
TTGTTCCAATCCTGTTGCCGTAGGAATGCAAGTTGCTCCATTATTTTAATATTTGAATATTAGCTAGTTTGTTTGTTAGTTTAAAATTGAACAGAAACGTACTGACCACA
ATCACTCTGGATGGGATTTGTTGTTGAGTGAGAAAGAAGACAAACAGGATAAAGAATTGTAGGATAATAGAAATGCAACAGCTACGGGAGGAGGAAAGTGAGATGAAAAT
ATATGGAAAAAAAGGGGTTTCTTTTTTCTTTTAAAAGGGAAATTATTGTCAGCCACTCACTGATTCAACAGTTAACGACGGTCTTGTTGTTGTGGAGGAAAGTAATTGAC
CCTGACTTTCCAAGAAATTGTTATCATATATAATATTATATAGAAGTTATAACCTCAATCATCGTTGAGTTATATTCATTTTGACG
Protein sequenceShow/hide protein sequence
MSDAQKEHAFTIERLAPRLAALRIELCPCHMSESYFWKVYFVLLHSRLNKHDAEVLSTPQVAAARSMWMQELQKQTKPESYWCSRRDTFELKDSSDVLQEDSSPMAFHDT
HSGSTLPWTFTYEPSMSSMSSNYETDKYQIESTETHFIDKSVIVEKPIIKNEDRNSTTGSSSKFLVQNYEDESDNDWLEEDSELGGCNGTILPLENEEDISFSDLDDDDM
VLPAKFKIASKE