; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041845 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041845
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr13:29328754..29333581
RNA-Seq ExpressionLag0041845
SyntenyLag0041845
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008483 - transaminase activity (molecular function)
GO:0030170 - pyridoxal phosphate binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005814 - Aminotransferase class-III
IPR012337 - Ribonuclease H-like superfamily
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-8079.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-7978.17Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD  FQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DP+EN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-8079.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-8079.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-7977.66Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMV SMMSYAQL SSFWGYAVETAV+ILNNVPSKSVS+ PFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.6e-7978.17Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD  FQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DP+EN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

A0A5A7TZD0 Gag/pol protein6.5e-8179.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

A0A5A7UYE8 Gag/pol protein6.5e-8179.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

A0A5D3BUN8 Gag/pol protein6.5e-8179.19Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMVRSMMSYAQL SSFWGYAVETAV+ILNNVPSKSVSETPFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

A0A5D3CYF4 Gag/pol protein3.6e-7977.66Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH
        MD RFQDYMIEHGIQSQLSAPGTPQ NGVSERRNRTLLDMV SMMSYAQL SSFWGYAVETAV+ILNNVPSKSVS+ PFELW+G               H
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKG--------------PH

Query:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ
        VLVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEEDH+R+HKPR+K++L EAT+ES RVVDE GPS+R+DE T++S  SHPSQ
Subjt:  VLVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-1531.03Show/hide
Query:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSV---SETPFELW--KGPHV--------
        +    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+++   S+TP+E+W  K P++        
Subjt:  MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSV---SETPFELW--KGPHV--------

Query:  -----LVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEES
             +     K + +S    FVGY  E  G   +D    K IV+ +    E + +     + + +  + ++ES
Subjt:  -----LVTNPKKLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-2238.41Show/hide
Query:  QRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGPHVLVTNPK--------
        + F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+L  SFWG AV+TA Y++N  PS  ++ E P  +W    V  ++ K        
Subjt:  QRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVS-ETPFELWKGPHVLVTNPK--------

Query:  --------KLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEED
                KL+ +S  C F+GY  E  G   +DP + KVI S +  F E +
Subjt:  --------KLEPRSKLCQFVGYPKETRGGYFYDPQENKVIVSTNATFLEED

Q940M2 Alanine--glyoxylate aminotransferase 2 homolog 1, mitochondrial9.2e-2480Show/hide
Query:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYTVS
        HCHP++L A+ EQ+KLLQHATTIYLHHAI DFAEALAAKMPGNL VVYFVNSG+ ANELAM+MA +YT S
Subjt:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYTVS

Q94AL9 Alanine--glyoxylate aminotransferase 2 homolog 2, mitochondrial4.9e-1757.89Show/hide
Query:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI
        HCHP+V+  V  Q K LQH T +YL+HAIADF+EALA+K+PG+L VV+F NSGT ANELA++MA         VA+
Subjt:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI

Q9SR86 Alanine--glyoxylate aminotransferase 2 homolog 3, mitochondrial7.3e-1337.84Show/hide
Query:  FGFAGSPNDGNSCKPSSSSSSSQQTSYESSSSYSF--SPVSLL---LLILVSLDASKSIDATRKGSLARFINHSCYCILWHCHPNVLAAVNEQNKLLQHA
        F ++  P DG    PS++   +++  + S + + F  +P++++   +  +   +  + +DA   G +A     SC     HCHP V+ +V +Q KL+ H+
Subjt:  FGFAGSPNDGNSCKPSSSSSSSQQTSYESSSSYSF--SPVSLL---LLILVSLDASKSIDATRKGSLARFINHSCYCILWHCHPNVLAAVNEQNKLLQHA

Query:  TTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYT
        T +YL+H I+DFAEAL + +PG+L VV+F NSGT ANELAM+MA +YT
Subjt:  TTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYT

Arabidopsis top hitse value%identityAlignment
AT1G76710.1 SET domain group 265.7e-0576.92Show/hide
Query:  LVSLDASKSIDATRKGSLARFINHSC
        ++SL+AS++IDAT+KGSLARFINHSC
Subjt:  LVSLDASKSIDATRKGSLARFINHSC

AT2G38400.1 alanine:glyoxylate aminotransferase 33.5e-1857.89Show/hide
Query:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI
        HCHP+V+  V  Q K LQH T +YL+HAIADF+EALA+K+PG+L VV+F NSGT ANELA++MA         VA+
Subjt:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI

AT2G38400.2 alanine:glyoxylate aminotransferase 33.5e-1857.89Show/hide
Query:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI
        HCHP+V+  V  Q K LQH T +YL+HAIADF+EALA+K+PG+L VV+F NSGT ANELA++MA         VA+
Subjt:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTVSASTFVAI

AT3G08860.1 PYRIMIDINE 45.2e-1437.84Show/hide
Query:  FGFAGSPNDGNSCKPSSSSSSSQQTSYESSSSYSF--SPVSLL---LLILVSLDASKSIDATRKGSLARFINHSCYCILWHCHPNVLAAVNEQNKLLQHA
        F ++  P DG    PS++   +++  + S + + F  +P++++   +  +   +  + +DA   G +A     SC     HCHP V+ +V +Q KL+ H+
Subjt:  FGFAGSPNDGNSCKPSSSSSSSQQTSYESSSSYSF--SPVSLL---LLILVSLDASKSIDATRKGSLARFINHSCYCILWHCHPNVLAAVNEQNKLLQHA

Query:  TTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYT
        T +YL+H I+DFAEAL + +PG+L VV+F NSGT ANELAM+MA +YT
Subjt:  TTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYT

AT4G39660.1 alanine:glyoxylate aminotransferase 26.5e-2580Show/hide
Query:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYTVS
        HCHP++L A+ EQ+KLLQHATTIYLHHAI DFAEALAAKMPGNL VVYFVNSG+ ANELAM+MA +YT S
Subjt:  HCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMA-VYTVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAAGATTCCAGGATTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCCCCTGGTACACCTCAACCAAATGGTGTATCAGAGAGGAGAAATAGAACCTT
GTTAGACATGGTTCGATCAATGATGAGCTACGCTCAATTGCTTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTATACATCTTGAACAATGTTCCCTCGAAAAGTG
TTTCTGAAACACCTTTCGAATTATGGAAGGGGCCACACGTGCTTGTGACAAATCCTAAGAAATTGGAACCTCGTTCTAAACTGTGCCAATTTGTTGGTTACCCTAAAGAA
ACAAGAGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGCCACCTTCTTAGAGGAAGACCACCTGAGAGATCATAAACCACGCAACAAAAT
AATATTAGGAGAAGCTACTGAAGAATCACCAAGAGTTGTTGATGAAGCTGGACCTTCAACAAGGATTGATGAAGGAACTAGTTCTTCAGATCCATCCCATCCATCTCAAT
TTTGTACCCTCACCGACGACGCAAGTTCTTCCTCTCACCGGCGGTGCAAGTTCGTCCTCTACCATCTCACGTTGTTGCCGGTAGGGGTTTTTTGTGGGGTCGTCGTCGTT
AGTGTGGGTCATCGCCATGGGTCATCCTCGCCGTGGGGAGTCGTCGTGGGTAGTAGATTTGAGCGAGCGAAGGTCGTGGGCAGTAGATCTGAGGTCATGGAGGCGTGGGT
TGTTGTCGTAGGAATATATTCAAAGCTTATGGTCGAATTGTTCGAATCTGATCGGTTCTCCGTAGATCTGAGATCTGAGTCGTATCTGCTTCGATTTTTCAGATATGGGT
ACTTCTATTCATTCGGATTCGCTGGCTCGCCAAATGACGGTAACTCTTGCAAGCCCTCGTCCTCTTCTTCGTCCTCGCAGCAGACTAGTTACGAGTCATCTTCGTCTTAT
TCTTTTTCTCCTGTTTCTCTATTGCTTCTCATACTTGTGTCGCTTGATGCTTCTAAATCAATTGATGCCACTAGAAAGGGAAGTCTTGCTAGATTTATAAATCATTCATG
TTACTGTATCTTGTGGCATTGCCATCCAAATGTTTTGGCTGCAGTCAATGAGCAAAACAAACTTCTACAGCATGCTACAACCATATACCTACACCATGCAATAGCTGATT
TTGCTGAAGCATTGGCGGCAAAAATGCCTGGAAACTTGAACGTTGTATATTTTGTAAACTCTGGGACATATGCAAATGAATTAGCCATGCTTATGGCCGTCTATACAGTG
TCGGCATCTACATTTGTTGCAATTGGCAGGGCAGTTCAATTCAAGTATTTTGATTCATCTGCAAAATTCTTGTGGCACCTTGCACGTGTTTTAATGTCCATGGTCGTCTC
CGAGCGTCACAAGCCACTCGTTGATGTGAAACTCTTCAGGTTGTGTCCGATTTGGCAATTGAGGAATACTTCGTCTTCTTCGTCGTCAACTCATAACCTTCACCATAGTG
GTCATGTCCCTATCGAAAGAATCTCAACGACGTCAAATAAATATCAAATTGTCATCGTCTTCCAAGATAGTGTCCTCTATGGCGAAGTCATTGTTGCCTCCTCGACGTCA
CCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAAGATTCCAGGATTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCCCCTGGTACACCTCAACCAAATGGTGTATCAGAGAGGAGAAATAGAACCTT
GTTAGACATGGTTCGATCAATGATGAGCTACGCTCAATTGCTTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTATACATCTTGAACAATGTTCCCTCGAAAAGTG
TTTCTGAAACACCTTTCGAATTATGGAAGGGGCCACACGTGCTTGTGACAAATCCTAAGAAATTGGAACCTCGTTCTAAACTGTGCCAATTTGTTGGTTACCCTAAAGAA
ACAAGAGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGCCACCTTCTTAGAGGAAGACCACCTGAGAGATCATAAACCACGCAACAAAAT
AATATTAGGAGAAGCTACTGAAGAATCACCAAGAGTTGTTGATGAAGCTGGACCTTCAACAAGGATTGATGAAGGAACTAGTTCTTCAGATCCATCCCATCCATCTCAAT
TTTGTACCCTCACCGACGACGCAAGTTCTTCCTCTCACCGGCGGTGCAAGTTCGTCCTCTACCATCTCACGTTGTTGCCGGTAGGGGTTTTTTGTGGGGTCGTCGTCGTT
AGTGTGGGTCATCGCCATGGGTCATCCTCGCCGTGGGGAGTCGTCGTGGGTAGTAGATTTGAGCGAGCGAAGGTCGTGGGCAGTAGATCTGAGGTCATGGAGGCGTGGGT
TGTTGTCGTAGGAATATATTCAAAGCTTATGGTCGAATTGTTCGAATCTGATCGGTTCTCCGTAGATCTGAGATCTGAGTCGTATCTGCTTCGATTTTTCAGATATGGGT
ACTTCTATTCATTCGGATTCGCTGGCTCGCCAAATGACGGTAACTCTTGCAAGCCCTCGTCCTCTTCTTCGTCCTCGCAGCAGACTAGTTACGAGTCATCTTCGTCTTAT
TCTTTTTCTCCTGTTTCTCTATTGCTTCTCATACTTGTGTCGCTTGATGCTTCTAAATCAATTGATGCCACTAGAAAGGGAAGTCTTGCTAGATTTATAAATCATTCATG
TTACTGTATCTTGTGGCATTGCCATCCAAATGTTTTGGCTGCAGTCAATGAGCAAAACAAACTTCTACAGCATGCTACAACCATATACCTACACCATGCAATAGCTGATT
TTGCTGAAGCATTGGCGGCAAAAATGCCTGGAAACTTGAACGTTGTATATTTTGTAAACTCTGGGACATATGCAAATGAATTAGCCATGCTTATGGCCGTCTATACAGTG
TCGGCATCTACATTTGTTGCAATTGGCAGGGCAGTTCAATTCAAGTATTTTGATTCATCTGCAAAATTCTTGTGGCACCTTGCACGTGTTTTAATGTCCATGGTCGTCTC
CGAGCGTCACAAGCCACTCGTTGATGTGAAACTCTTCAGGTTGTGTCCGATTTGGCAATTGAGGAATACTTCGTCTTCTTCGTCGTCAACTCATAACCTTCACCATAGTG
GTCATGTCCCTATCGAAAGAATCTCAACGACGTCAAATAAATATCAAATTGTCATCGTCTTCCAAGATAGTGTCCTCTATGGCGAAGTCATTGTTGCCTCCTCGACGTCA
CCTTGA
Protein sequenceShow/hide protein sequence
MDQRFQDYMIEHGIQSQLSAPGTPQPNGVSERRNRTLLDMVRSMMSYAQLLSSFWGYAVETAVYILNNVPSKSVSETPFELWKGPHVLVTNPKKLEPRSKLCQFVGYPKE
TRGGYFYDPQENKVIVSTNATFLEEDHLRDHKPRNKIILGEATEESPRVVDEAGPSTRIDEGTSSSDPSHPSQFCTLTDDASSSSHRRCKFVLYHLTLLPVGVFCGVVVV
SVGHRHGSSSPWGVVVGSRFERAKVVGSRSEVMEAWVVVVGIYSKLMVELFESDRFSVDLRSESYLLRFFRYGYFYSFGFAGSPNDGNSCKPSSSSSSSQQTSYESSSSY
SFSPVSLLLLILVSLDASKSIDATRKGSLARFINHSCYCILWHCHPNVLAAVNEQNKLLQHATTIYLHHAIADFAEALAAKMPGNLNVVYFVNSGTYANELAMLMAVYTV
SASTFVAIGRAVQFKYFDSSAKFLWHLARVLMSMVVSERHKPLVDVKLFRLCPIWQLRNTSSSSSSTHNLHHSGHVPIERISTTSNKYQIVIVFQDSVLYGEVIVASSTS
P