; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007612 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007612
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:1954236..1958121
RNA-Seq ExpressionLag0007612
SyntenyLag0007612
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.3e-5351.81Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F+SN  + HW+A KW++RYLKG+++  L Y++    +  + GF D+DYA DLDKRRSL+G++F ++GN++SWK +LQ VVALS+TE EYI+L E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL
        EAVWLK  V E+L+      I+CD+QSA+ L+KNP  H+R+KHIDV+FHY+R  + +  V+L K+HT  N +D++TK L+A+RF+YL D L +
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL

KAE8700517.1 hypothetical protein F3Y22_tig00110556pilonHSYRG00215 [Hibiscus syriacus]5.1e-5450.5Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F++N  + HWEA KW +RYL+GT+  GL++ K ++  E +VG+VDSDYA  +D R+SLTG++F+VFG  +SWKS+LQSVVALS+TE EYIA+ E +K
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDHRVSKV
        EA+WL+G V E+      + ++CDNQS + L++N  FH+R+KHIDV+ H+VR+ V +GS+ + KI T  NPAD++TK L A +F +  DL +L    +KV
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDHRVSKV

KAG7962752.1 hypothetical protein I3843_09G081300 [Carya illinoinensis]9.6e-5352.31Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ ++ N  + HW AAKWI+RY+ GT + GL + KS+ S   + G+VDSDYA DLDKRRS TGYVF++ G  +SW+S+LQS +ALSSTE EY+A+ E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDH
        EA+WL+G V+++     EV +YCD+QSA+ L+KN  +H RTKHIDVRFH+VRE ++EG + L KI T  NPAD++TK +   +F++  DL+ + +
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDH

KZV43791.1 hypothetical protein F511_41481 [Dorcoceras hygrometricum]3.3e-5352.58Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F++N  + HW+A KW++RYLKGT N GL+Y  + +  EA++G+VDSDYA  +D RRSLTGYVF+V+G  +SWK++LQSVVALS+TE EYIA+ E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLD
        EA+WLKG  SE+      + + CD+QSA+ L+KN  FH+RTKHIDV+ H+VRE V  G V + K+ T  N AD++TK L +N+F +   LL+++
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLD

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]3.3e-5351.81Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F+SN  + HW+A KW++RYLKG+++  L Y++    +  + GF D+DYA DLDKRRSL+G++F ++GN++SWK +LQ VVALS+TE EYI+L E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL
        EAVWLK  V E+L+      I+CD+QSA+ L+KNP  H+R+KHIDV+FHY+R  + +  V+L K+HT  N +D++TK L+A+RF+YL D L +
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL

TrEMBL top hitse value%identityAlignment
A0A1J3DQ82 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)1.2e-5355.85Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F+SN  R HW A KW++RYL+G+S   L +TK  S   +I GF DSDYA DLD+RRS+TG++F V+GN +SW+S+LQSVVALS+TE EY+AL+  VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLC
        EA+WLKG  SE+      V+IYCD+QSAL+L+KN  +H+RTKHI  ++H++R+ V +G+V+L KIHT+ NPAD +TK L   +FE LC
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLC

A0A2Z7C9U2 Uncharacterized protein1.6e-5352.58Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F++N  + HW+A KW++RYLKGT N GL+Y  + +  EA++G+VDSDYA  +D RRSLTGYVF+V+G  +SWK++LQSVVALS+TE EYIA+ E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLD
        EA+WLKG  SE+      + + CD+QSA+ L+KN  FH+RTKHIDV+ H+VRE V  G V + K+ T  N AD++TK L +N+F +   LL+++
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLD

A0A5A7UB25 Putative gag-pol polyprotein1.6e-5351.81Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F+SN  + HW+A KW++RYLKG+++  L Y++    +  + GF D+DYA DLDKRRSL+G++F ++GN++SWK +LQ VVALS+TE EYI+L E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL
        EAVWLK  V E+L+      I+CD+QSA+ L+KNP  H+R+KHIDV+FHY+R  + +  V+L K+HT  N +D++TK L+A+RF+YL D L +
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL

A0A5D3CTV2 Putative polyprotein1.6e-5351.81Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F+SN  + HW+A KW++RYLKG+++  L Y++    +  + GF D+DYA DLDKRRSL+G++F ++GN++SWK +LQ VVALS+TE EYI+L E VK
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL
        EAVWLK  V E+L+      I+CD+QSA+ L+KNP  H+R+KHIDV+FHY+R  + +  V+L K+HT  N +D++TK L+A+RF+YL D L +
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL

A0A6A3A9V0 Uncharacterized protein2.5e-5450.5Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        I+ F++N  + HWEA KW +RYL+GT+  GL++ K ++  E +VG+VDSDYA  +D R+SLTG++F+VFG  +SWKS+LQSVVALS+TE EYIA+ E +K
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDHRVSKV
        EA+WL+G V E+      + ++CDNQS + L++N  FH+R+KHIDV+ H+VR+ V +GS+ + KI T  NPAD++TK L A +F +  DL +L    +KV
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDHRVSKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-3441Show/hide
Query:  TRTRWIAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFG-NIISWKSSLQSVVALSSTEVEYIA
        T    ++ + S      W+  K ++RYLKGT +  L++ K+ +    I+G+VDSD+A     R+S TGY+F +F  N+I W +  Q+ VA SSTE EY+A
Subjt:  TRTRWIAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFG-NIISWKSSLQSVVALSSTEVEYIA

Query:  LAECVKEAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL
        L E V+EA+WLK  ++ + +     ++IY DNQ  +S++ NP  H R KHID+++H+ RE VQ   + L  I T +  ADI TK L A RF  L D L L
Subjt:  LAECVKEAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKL

P0CV72 Secreted RxLR effector protein 1618.6e-2045.71Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F S+    HW+A K ++RYL+ T   GL +T+  + T  +VG+ D+D+A D++ RRS +GY+F + G  +SW+S  Q  VALSSTE EY+AL+E  +
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWL
        EAVWL
Subjt:  EAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-4647.64Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F+ N  + HWEA KWI+RYL+GT+   L +  SD     + G+ D+D A D+D R+S TGY+F+  G  ISW+S LQ  VALS+TE EYIA  E  K
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLL
        E +WLK  + E+     E  +YCD+QSA+ LSKN  +H RTKHIDVR+H++RE V + S+++ KI T  NPAD++TK +  N+FE   +L+
Subjt:  EAVWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2434.95Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F+      H +A K I+RYL GT N G+   K   +T ++  + D+D+A D D   S  GY+  +  + ISW S  Q  V  SSTE EY ++A    
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFE
        E  W+   ++E+ +       IYCDN  A  L  NP FH R KHI + +H++R  VQ G++++  + T    AD +TK L+   F+
Subjt:  EAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-2436Show/hide
Query:  HWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVKEAVWLKGNVSE
        HW A K ++RYL GT + G+   K   +T ++  + D+D+A D D   S  GY+  +  + ISW S  Q  V  SSTE EY ++A    E  W+   ++E
Subjt:  HWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVKEAVWLKGNVSE

Query:  M-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFE
        + +       IYCDN  A  L  NP FH R KHI + +H++R  VQ G++++  + T    AD +TK L+   F+
Subjt:  M-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-1936.36Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK
        ++ F    +  H +A   I+ Y+KGT  +GL Y  S  +   +  F D+ +    D RRS  GY   +  ++ISWKS  Q VV+ SS E EY AL+    
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVK

Query:  EAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVRE
        E +WL     E+ L       ++CDN +A+ ++ N  FH+RTKHI+   H VRE
Subjt:  EAVWLKGNVSEM-LNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.7e-0435.94Show/hide
Query:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGY
        ++ F S ++    +A   ++ Y+KGT  +GL Y  S +S   +  F DSD+A   D RRS+TG+
Subjt:  IAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTACACGAACGCGTTGGATCGCAGCGTTTGTATCAAATACAAAAAGACCTCACTGGGAAGCTGCGAAATGGATTGTAAGGTACTTAAAAGGAACTTCTAACAG
AGGACTATTGTATACCAAATCTGATTCCAGTACTGAAGCAATAGTTGGCTTTGTGGACTCGGATTATGCAGTCGACTTAGATAAAAGACGTTCATTGACTGGATATGTGT
TTTCTGTTTTTGGTAATATAATAAGCTGGAAGTCCTCTTTGCAATCGGTAGTTGCCTTGTCCTCAACAGAAGTCGAATACATAGCCCTAGCAGAATGTGTAAAAGAAGCT
GTATGGTTGAAGGGAAATGTCTCTGAGATGTTGAATGCAGTGGCTGAGGTTCGGATTTACTGTGACAACCAGAGTGCGTTGTCTTTGTCCAAAAATCCTTTCTTTCATGA
TAGAACCAAACACATTGATGTGAGGTTCCATTATGTGAGGGAAGCTGTTCAAGAGGGTTCTGTCCAGCTATCTAAAATCCATACCACTCATAATCCAGCTGATATTATGA
CTAAGACTCTAGCAGCAAACCGTTTTGAATATCTCTGTGACCTGTTGAAATTGGACCACAGAGTAAGCAAGGTAATTTTGGACCACCATAGTGTACGAGGATCTGACGAG
GACAACCTGGTGGAGATGGGACCAGGAAACGACCCAGAGGAAAACCAAACCAACGGGTTGGGCCAACGTGGCCCGACCCGTAATGTCGGCCTCGGCCTTGGGCCGAGGCC
GACCACTCGACTCGCTTGCGCGGCCGAGTCCGTTTGCCTCCGCTCGGTCCCTACCGCCTCTGGCCGCCCCGGTTCCGCTTGGCATCGGAGGCGGTGTGGCCTACACCACA
CCGGTGTGCAGCGGTTTTTACTGGTCTTGCAGGTCACGTCTTCCCCAACTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGAATTTAAACGTGCCCTTAACAAG
GATGAAAATCATGGTAAGGAAATTGGAAGAAAACAAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTACACGAACGCGTTGGATCGCAGCGTTTGTATCAAATACAAAAAGACCTCACTGGGAAGCTGCGAAATGGATTGTAAGGTACTTAAAAGGAACTTCTAACAG
AGGACTATTGTATACCAAATCTGATTCCAGTACTGAAGCAATAGTTGGCTTTGTGGACTCGGATTATGCAGTCGACTTAGATAAAAGACGTTCATTGACTGGATATGTGT
TTTCTGTTTTTGGTAATATAATAAGCTGGAAGTCCTCTTTGCAATCGGTAGTTGCCTTGTCCTCAACAGAAGTCGAATACATAGCCCTAGCAGAATGTGTAAAAGAAGCT
GTATGGTTGAAGGGAAATGTCTCTGAGATGTTGAATGCAGTGGCTGAGGTTCGGATTTACTGTGACAACCAGAGTGCGTTGTCTTTGTCCAAAAATCCTTTCTTTCATGA
TAGAACCAAACACATTGATGTGAGGTTCCATTATGTGAGGGAAGCTGTTCAAGAGGGTTCTGTCCAGCTATCTAAAATCCATACCACTCATAATCCAGCTGATATTATGA
CTAAGACTCTAGCAGCAAACCGTTTTGAATATCTCTGTGACCTGTTGAAATTGGACCACAGAGTAAGCAAGGTAATTTTGGACCACCATAGTGTACGAGGATCTGACGAG
GACAACCTGGTGGAGATGGGACCAGGAAACGACCCAGAGGAAAACCAAACCAACGGGTTGGGCCAACGTGGCCCGACCCGTAATGTCGGCCTCGGCCTTGGGCCGAGGCC
GACCACTCGACTCGCTTGCGCGGCCGAGTCCGTTTGCCTCCGCTCGGTCCCTACCGCCTCTGGCCGCCCCGGTTCCGCTTGGCATCGGAGGCGGTGTGGCCTACACCACA
CCGGTGTGCAGCGGTTTTTACTGGTCTTGCAGGTCACGTCTTCCCCAACTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGAATTTAAACGTGCCCTTAACAAG
GATGAAAATCATGGTAAGGAAATTGGAAGAAAACAAACTTAA
Protein sequenceShow/hide protein sequence
MSPTRTRWIAAFVSNTKRPHWEAAKWIVRYLKGTSNRGLLYTKSDSSTEAIVGFVDSDYAVDLDKRRSLTGYVFSVFGNIISWKSSLQSVVALSSTEVEYIALAECVKEA
VWLKGNVSEMLNAVAEVRIYCDNQSALSLSKNPFFHDRTKHIDVRFHYVREAVQEGSVQLSKIHTTHNPADIMTKTLAANRFEYLCDLLKLDHRVSKVILDHHSVRGSDE
DNLVEMGPGNDPEENQTNGLGQRGPTRNVGLGLGPRPTTRLACAAESVCLRSVPTASGRPGSAWHRRRCGLHHTGVQRFLLVLQVTSSPTSTNSLLVSREGQEFKRALNK
DENHGKEIGRKQT