; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000735 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000735
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:14576584..14586431
RNA-Seq ExpressionLag0000735
SyntenyLag0000735
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR015943 - WD40/YVTN repeat-like-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031737558.1 secreted RxLR effector protein 161-like [Cucumis sativus]9.4e-4960.61Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        + NA+ VT P+A HFKLS+ NSP   + EH   M NVPYSQ VGSLMYLM+STRPDLSY+ SL+S+YM N G+RHWEA KW++RYL  S  ARL Y+ + 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIKRK
        +   E+  YVDSD+A D DKR SLTGY+FL G NL+SWK  LQ +VALS+TEA++IALSE +K +
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIKRK

XP_038877001.1 secreted RxLR effector protein 161-like [Benincasa hispida]1.0e-4762.58Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M+ AK V TPL   FKLS+ANSPK+ +E+H+  M  VPYSQ VG LMYLM STR DLSYATSL+SRYM +P KR+WEA KW+LRYL  S  AR+ YK +D
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
           +E+Y Y+D+DYA DLD+R SL+GY+FL G NL++ K +LQ +VALS+TEAKFIALSE +K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

XP_038877926.1 secreted RxLR effector protein 161-like [Benincasa hispida]5.5e-4962.58Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M + K V+ P+ QHFKLS ANSPK  ++EH K M  +PYSQ VGSLMYLM+STRPDLSY+TSL+SRYM NPGK HW+A KW+LRYL     ++L Y+ S 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
            E+Y Y+D+DYA DLDKR SLT YLFL G NL+SWK +LQP+VALS+TEA+++ L EAIK
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

XP_038885927.1 secreted RxLR effector protein 161-like [Benincasa hispida]1.9e-4955.73Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMI-----NPGKRHWEAAKWVLRYLKGSTGARLE
        ME AK V T  A H+K S+ N+PK+ +EEHL  M  V YSQ +GSLMYLM STRPDLSYATS++SRYM      NPG+RHWEA KW+LRYL  S  A++ 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMI-----NPGKRHWEAAKWVLRYLKGSTGARLE

Query:  YKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIKRKGEQEDEEQLIPSPKFVSRWRFE
        Y+ ++   +E+Y YV++DYA DLDKR SL+GY+FLLG NL+SWK TLQ VVALS+ EA+FIALSE +K K     + + I    +  R   E
Subjt:  YKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIKRKGEQEDEEQLIPSPKFVSRWRFE

XP_038895984.1 secreted RxLR effector protein 161-like [Benincasa hispida]3.8e-5064.42Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M  AK+V TP+A HFKLS+ NSPK  + E+ K M+ +PYSQVVGSLMYLMVSTR D++YATSL+SRYM NPGKRHWEA KW++RYLKG+T A+L Y+   
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        +   ++  YVDSDYA DLDKR SL+GY+FL G  +LS K TLQ +VALS+TE +FIALSEA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

TrEMBL top hitse value%identityAlignment
A0A2P5W031 Uncharacterized protein4.4e-4457.67Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M+ AK V+TPLA HFKLS+  SP+++EE+  + M+++PYS  VGS+MY MV TRPD+S+A S++SRYM  PGK HW+A KW+LRYL+GS    L Y  S 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        DC + +  YVDSDYA DLDKR SLTGY+F   G  +SWK  LQ  VALS+TEA+++AL+EA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

A0A2P5YYC3 Uncharacterized protein4.4e-4457.67Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M+ AK V+TPLA HFKLS+  SP+++EE+  + M+++PYS  VGS+MY MV TRPD+S+A S++SRYM  PGK HW+A KW+LRYL+GS    L Y  S 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        DC + +  YVDSDYA DLDKR SLTGY+F   G  +SWK  LQ  VALS+TEA+++AL+EA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class9.8e-4457.06Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M ++KAV+TPLA HF+LSS+  P  ++E     M+N+PY   VGS+MYLM+ TRPDL YA S+ISR+M NPGK HW+A KWVLRYLKGS    L Y    
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        D    +  + D+DYA+DLDKR SL+G++F L GN++SWKV LQPVVALS+TE+++I+L EA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

A0A5A7UB25 Putative gag-pol polyprotein9.8e-4457.06Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M ++KAV+TPLA HF+LSS+  P  ++E     M+N+PY   VGS+MYLM+ TRPDL YA S+ISR+M NPGK HW+A KWVLRYLKGS    L Y    
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        D    +  + D+DYA+DLDKR SL+G++F L GN++SWKV LQPVVALS+TE+++I+L EA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

A0A5N6PAJ0 Integrase catalytic domain-containing protein5.8e-4457.06Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        MEN+K V TPL  HFKLS+A+SP   +E  + +M  VPY+  VGSLMYLMV +RPD+ +A SL+SRY+ NPGK HW+A KW+LRYL G+    L Y S  
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
        +  N++Y +VDSD+A DLDK  S+TGY F + GN++SWK  LQ VVALS+TEA++IAL+EA+K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-2341.46Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        MEN  AV+TPL         NS    +E+      N P   ++G LMY+M+ TRPDL+ A +++SRY        W+  K VLRYLKG+   +L +K + 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLF-LLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
           N+I  YVDSD+A     R S TGYLF +   NL+ W    Q  VA SSTEA+++AL EA++
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLF-LLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

P0CV72 Secreted RxLR effector protein 1612.4e-2346.09Show/hide
Query:  MTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLGG
        M NVPY   VG++MYLMV TRPDL+ A  ++S++  +P   HW+A K VLRYL+ +    LE+  +     ++  Y D+D+A D++ R S +GYLF L G
Subjt:  MTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLGG

Query:  NLLSWKVTLQPVVALSSTEAKFIALSEA
          +SW+   Q  VALSSTE +++ALSEA
Subjt:  NLLSWKVTLQPVVALSSTEAKFIALSEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-3752.15Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M+NAK V+TPLA H KLS    P   EE+    M  VPYS  VGSLMY MV TRPD+++A  ++SR++ NPGK HWEA KW+LRYL+G+TG  L +  SD
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK
             +  Y D+D A D+D R S TGYLF   G  +SW+  LQ  VALS+TEA++IA +E  K
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1637.42Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M  AK VTTP+A         SPK       K      Y  +VGSL YL   TRPD+SYA + +S++M  P + H +A K +LRYL G+    +  K  +
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKF
             ++ Y D+D+A D D  +S  GY+  LG + +SW    Q  V  SSTEA++
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.9e-1839.35Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M  AK V TP+        A SPK       K      Y  +VGSL YL   TRPDLSYA + +S+YM  P   HW A K VLRYL G+    +  K  +
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKF
             ++ Y D+D+A D D  +S  GY+  LG + +SW    Q  V  SSTEA++
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKF

Arabidopsis top hitse value%identityAlignment
AT2G37160.1 Transducin/WD40 repeat-like superfamily protein8.8e-2174.19Show/hide
Query:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV
        EDD V+VWS+EDRKV+AWGEGHNSWVSGVAFDSYWS  P S+ + ENV+YRFGSVGQ  Q++
Subjt:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV

AT2G37160.2 Transducin/WD40 repeat-like superfamily protein8.8e-2174.19Show/hide
Query:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV
        EDD V+VWS+EDRKV+AWGEGHNSWVSGVAFDSYWS  P S+ + ENV+YRFGSVGQ  Q++
Subjt:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV

AT3G53390.1 Transducin/WD40 repeat-like superfamily protein1.3e-1972.58Show/hide
Query:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV
        EDD V+VWS+EDRKV+AWGEGHNSWVSGVAFDS WS  P SD + E+V+YRFGSVGQ  Q++
Subjt:  EDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSVGQVLQVV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.2e-1537.21Show/hide
Query:  FMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLG
        F+    Y +++G LMYL + TR D+S+A + +S++   P   H +A   +L Y+KG+ G  L Y S  +   +++   D+ + S  D R S  GY   LG
Subjt:  FMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSDDCPNEIYEYVDSDYASDLDKRISLTGYLFLLG

Query:  GNLLSWKVTLQPVVALSSTEAKFIALSEA
         +L+SWK   Q VV+ SS EA++ ALS A
Subjt:  GNLLSWKVTLQPVVALSSTEAKFIALSEA

ATMG00810.1 DNA/RNA polymerases superfamily protein6.1e-1433.96Show/hide
Query:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD
        M + K ++TPL    KL+S+ S         K+     +  +VG+L YL + TRPD+SYA +++ + M  P    ++  K VLRY+KG+    L    + 
Subjt:  MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSD

Query:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALS
             +  + DSD+A     R S TG+   LG N++SW    QP V+ SSTE ++ AL+
Subjt:  DCPNEIYEYVDSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATGCTAAAGCGGTTACAACTCCATTGGCTCAACACTTCAAGTTATCTTCTGCTAATTCACCTAAGGCAGAAGAAGAAGAGCACTTGAAGTTTATGACAAATGT
ACCTTATTCTCAAGTTGTTGGAAGCCTTATGTATTTAATGGTTTCTACAAGGCCAGATTTGTCATATGCCACGAGTTTAATCAGCAGATACATGATAAATCCAGGTAAAA
GACATTGGGAAGCTGCAAAATGGGTATTGAGATACCTTAAAGGGTCTACTGGAGCTAGACTTGAATATAAAAGCAGTGATGATTGTCCTAATGAGATCTACGAATATGTG
GACTCAGATTATGCTAGTGATTTGGACAAAAGAATATCATTGACAGGGTATTTGTTCCTTCTTGGAGGAAATCTTCTAAGTTGGAAAGTAACCCTTCAACCAGTGGTAGC
TCTATCTTCAACTGAAGCAAAATTTATTGCTTTATCAGAAGCTATAAAAAGAAAAGGTGAACAAGAAGATGAGGAACAGCTCATTCCTTCGCCTAAATTTGTTTCAAGGT
GGAGATTTGAAGGTTATGTCTCCAACTCTGAAGTGTGGCCCGAAAAAATAATGTGCACGAGGTACTCTATTCTTCAAACTAAGCCAATTGAGATGCCCTTTGTAGTTCCT
TCACTTGGGGTTTTTGCTGTGAGTTTTGCCTTCAATTTTGGTTCAGGGATCTTTCTTGTGGAAATGTTAAGTAACTTCCATTTTTGTGGAAGGGTCATTCGTTGGCAGGG
TAGAATTTTCTTACAATGTAACACAAGGCTCTTGAATGATAAAAGAAATACTCGGGAAGACGATTTTGTCAAAGTATGGAGCATAGAAGATAGGAAGGTGATGGCATGGG
GCGAGGGACATAATTCATGGGTTAGTGGTGTAGCTTTTGATTCATATTGGTCATATCACCCAGCTTCGGATGACACAGAGGAAAATGTTGTTTACCGATTTGGTTCAGTT
GGTCAGGTTCTTCAAGTTGTTGAACATATGTTTGTTATGTTCTTCAAGATCGATGTACACGAGCAAGATTCTGCCGAAAAGAATGGAAGTTGCTGCATGGTCAGCGTCTC
GACACTGACCCCTCTTCCAGCCTTGCTTCCTTTTTTCTCAATCTTTCTCGAAACCACCACCACTGCAGCCTGTGAAGGTAAAATCATCCACGCCTCGGGTAAAGAAAACG
CCAACCAAGAAACCAAGGGTGGAACGACAGGTCCACTCCTTGCTTTTATTGATGACATTGTGCGTAGGTATGGTTGGGACCGTCTTCACAAGGGACTTAAAGCAGCCTCC
GTCCCCCTGATTAATCAAAATTTTCGATTACCATCCCTCCGACAACCAGGAGGAAATATTGCTATAAAAGAGCCAACAGATGAGCAACTAGCTGAGGCCCTTTCCTTAGT
GGCCCACGAAGGAGTGAAGTTGAAAGAGTCCTTGACGGGGATCAGGACTCTGTTTTGTAGAGATTTGAAGCCAGAGGCGTCAGTCTGGCTTTATGTGATCAAGAACAAGA
TCATGCCTACGAAGCATGACACGACGATCTCACAAGAACATGTCATGTTGTTATACAACATGTATAAAGGCTTGGATCTCAACCTTAACCTAATTCTTTGGAAAGAGATC
CTTTCCTATGGGCATAAGAACGTGGGGGAGGCTCTTCTTCCTAAGCTTGATCACCAAAATCTGCAGGCTTGCAGTGTTGGGTTTTATGTCCTAAAACTCAAAGTGACCCC
TCCAGAAAAGAAGGCACAGCAGACCCCTCAAACAACAATGCCTCCAGTAGAACCTTCCACAGCGGCAGATGCAGCCGCAAATCAACCGCAAATTCCTGATCCTCCGCAGC
AAGCCCCTCAGCCACAGCCTGAAGATAATTTAGTGGCGTTGATAAGAGATTTAGAGCATAAGCAGCGCAATGCAAGAACTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATGCTAAAGCGGTTACAACTCCATTGGCTCAACACTTCAAGTTATCTTCTGCTAATTCACCTAAGGCAGAAGAAGAAGAGCACTTGAAGTTTATGACAAATGT
ACCTTATTCTCAAGTTGTTGGAAGCCTTATGTATTTAATGGTTTCTACAAGGCCAGATTTGTCATATGCCACGAGTTTAATCAGCAGATACATGATAAATCCAGGTAAAA
GACATTGGGAAGCTGCAAAATGGGTATTGAGATACCTTAAAGGGTCTACTGGAGCTAGACTTGAATATAAAAGCAGTGATGATTGTCCTAATGAGATCTACGAATATGTG
GACTCAGATTATGCTAGTGATTTGGACAAAAGAATATCATTGACAGGGTATTTGTTCCTTCTTGGAGGAAATCTTCTAAGTTGGAAAGTAACCCTTCAACCAGTGGTAGC
TCTATCTTCAACTGAAGCAAAATTTATTGCTTTATCAGAAGCTATAAAAAGAAAAGGTGAACAAGAAGATGAGGAACAGCTCATTCCTTCGCCTAAATTTGTTTCAAGGT
GGAGATTTGAAGGTTATGTCTCCAACTCTGAAGTGTGGCCCGAAAAAATAATGTGCACGAGGTACTCTATTCTTCAAACTAAGCCAATTGAGATGCCCTTTGTAGTTCCT
TCACTTGGGGTTTTTGCTGTGAGTTTTGCCTTCAATTTTGGTTCAGGGATCTTTCTTGTGGAAATGTTAAGTAACTTCCATTTTTGTGGAAGGGTCATTCGTTGGCAGGG
TAGAATTTTCTTACAATGTAACACAAGGCTCTTGAATGATAAAAGAAATACTCGGGAAGACGATTTTGTCAAAGTATGGAGCATAGAAGATAGGAAGGTGATGGCATGGG
GCGAGGGACATAATTCATGGGTTAGTGGTGTAGCTTTTGATTCATATTGGTCATATCACCCAGCTTCGGATGACACAGAGGAAAATGTTGTTTACCGATTTGGTTCAGTT
GGTCAGGTTCTTCAAGTTGTTGAACATATGTTTGTTATGTTCTTCAAGATCGATGTACACGAGCAAGATTCTGCCGAAAAGAATGGAAGTTGCTGCATGGTCAGCGTCTC
GACACTGACCCCTCTTCCAGCCTTGCTTCCTTTTTTCTCAATCTTTCTCGAAACCACCACCACTGCAGCCTGTGAAGGTAAAATCATCCACGCCTCGGGTAAAGAAAACG
CCAACCAAGAAACCAAGGGTGGAACGACAGGTCCACTCCTTGCTTTTATTGATGACATTGTGCGTAGGTATGGTTGGGACCGTCTTCACAAGGGACTTAAAGCAGCCTCC
GTCCCCCTGATTAATCAAAATTTTCGATTACCATCCCTCCGACAACCAGGAGGAAATATTGCTATAAAAGAGCCAACAGATGAGCAACTAGCTGAGGCCCTTTCCTTAGT
GGCCCACGAAGGAGTGAAGTTGAAAGAGTCCTTGACGGGGATCAGGACTCTGTTTTGTAGAGATTTGAAGCCAGAGGCGTCAGTCTGGCTTTATGTGATCAAGAACAAGA
TCATGCCTACGAAGCATGACACGACGATCTCACAAGAACATGTCATGTTGTTATACAACATGTATAAAGGCTTGGATCTCAACCTTAACCTAATTCTTTGGAAAGAGATC
CTTTCCTATGGGCATAAGAACGTGGGGGAGGCTCTTCTTCCTAAGCTTGATCACCAAAATCTGCAGGCTTGCAGTGTTGGGTTTTATGTCCTAAAACTCAAAGTGACCCC
TCCAGAAAAGAAGGCACAGCAGACCCCTCAAACAACAATGCCTCCAGTAGAACCTTCCACAGCGGCAGATGCAGCCGCAAATCAACCGCAAATTCCTGATCCTCCGCAGC
AAGCCCCTCAGCCACAGCCTGAAGATAATTTAGTGGCGTTGATAAGAGATTTAGAGCATAAGCAGCGCAATGCAAGAACTACATGA
Protein sequenceShow/hide protein sequence
MENAKAVTTPLAQHFKLSSANSPKAEEEEHLKFMTNVPYSQVVGSLMYLMVSTRPDLSYATSLISRYMINPGKRHWEAAKWVLRYLKGSTGARLEYKSSDDCPNEIYEYV
DSDYASDLDKRISLTGYLFLLGGNLLSWKVTLQPVVALSSTEAKFIALSEAIKRKGEQEDEEQLIPSPKFVSRWRFEGYVSNSEVWPEKIMCTRYSILQTKPIEMPFVVP
SLGVFAVSFAFNFGSGIFLVEMLSNFHFCGRVIRWQGRIFLQCNTRLLNDKRNTREDDFVKVWSIEDRKVMAWGEGHNSWVSGVAFDSYWSYHPASDDTEENVVYRFGSV
GQVLQVVEHMFVMFFKIDVHEQDSAEKNGSCCMVSVSTLTPLPALLPFFSIFLETTTTAACEGKIIHASGKENANQETKGGTTGPLLAFIDDIVRRYGWDRLHKGLKAAS
VPLINQNFRLPSLRQPGGNIAIKEPTDEQLAEALSLVAHEGVKLKESLTGIRTLFCRDLKPEASVWLYVIKNKIMPTKHDTTISQEHVMLLYNMYKGLDLNLNLILWKEI
LSYGHKNVGEALLPKLDHQNLQACSVGFYVLKLKVTPPEKKAQQTPQTTMPPVEPSTAADAAANQPQIPDPPQQAPQPQPEDNLVALIRDLEHKQRNARTT