; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036932 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036932
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase-type DNA-binding superfamily protein
Genome locationchr2:2222836..2227193
RNA-Seq ExpressionLag0036932
SyntenyLag0036932
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0016874 - ligase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72476.1 hypothetical protein VITISV_022512 [Vitis vinifera]6.3e-3832.44Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +G+ G++ K+D+EKAYD + WD+LD +L+KKGF  +WR+W+ GCL++ ++++ +NG ++G + A RGLRQGDPLSPFLFTL+ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG
            ++ +L                                              GL +N+ K++I  +N+ +  +S LA    CK        LG PLG
Subjt:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG

Query:  G----------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW
                         N RR L D EI     L+  LD + L     D   W L  SG+FS KS F  LS  S S +      +W +  P KV+ F+W
Subjt:  G----------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW

CAN77848.1 hypothetical protein VITISV_020832 [Vitis vinifera]2.1e-4135.66Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +G+ G++ K+D+EKAYD V WD+LD +L+KKGF  +WR+W+ GCL++ ++++ +NG ++G + A RGLRQGDPLSPFLFTL+ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLR----------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG----------------NHRRGLFDWEIP
            ++ +L                 GL +++ K++I G+N+ +  +S L     CK        LG PLGG                N RR L D+EI 
Subjt:  HFCCEKRVLR----------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG----------------NHRRGLFDWEIP

Query:  SWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLWSG
           GL+  LD + L     D   W L   G+FS K  F  LS  S S +      +W +  P KV+ F+  G
Subjt:  SWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLWSG

CAN78744.1 hypothetical protein VITISV_014186 [Vitis vinifera]2.0e-3630.29Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +GK G++ K+D+EKAYD V WD+LD +L+KKGF  RWR+W+ GCL++ +++I +NG ++G + A RGLRQGDPLSPFLFTL+ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG
            E+ ++                                               L +N++K+SI  +N+ +  LS LA    CK+       LG PLG
Subjt:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG

Query:  G-------------------------------------------------------------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RD
                                                                            N RR L D EI    GL+  LD + L     D
Subjt:  G-------------------------------------------------------------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RD

Query:  VCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW
          LWSL   G+FS KS F  LS SS S +      +W +  P KVK F+W
Subjt:  VCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW

RVW62473.1 putative mitochondrial protein [Vitis vinifera]3.1e-3745.22Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +G+ G++ K+D+EKAYD V WD+LD +L+KKGF ++WR W+RGCL++ +++I +NG ++G + A RGLRQGDPLSPFLFT++ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLRGLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGN
            E+ +L G  +N+ K+++ G+N+ +  LS LA    CK        LG PLGGN
Subjt:  HFCCEKRVLRGLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGN

XP_022158566.1 uncharacterized protein LOC111025019 [Momordica charantia]4.8e-3842.19Show/hide
Query:  VDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLIHFCCEKRVLR------------------
        VDW +LD +L +KGFG RWR W++GC+++ NFS+F+NGR +GKI A RGLRQGDPLSPFLFTL+ DSFSRL++FCCE+R+++                  
Subjt:  VDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLIHFCCEKRVLR------------------

Query:  ---------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWEIPSWVGLIE
                                   GL +N++KTS+IG+N+ +   +    + GC++E +SF+ LG PLGGNHR  +F      W  L+E
Subjt:  ---------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWEIPSWVGLIE

TrEMBL top hitse value%identityAlignment
A0A6J1E196 uncharacterized protein LOC1110250192.3e-3842.19Show/hide
Query:  VDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLIHFCCEKRVLR------------------
        VDW +LD +L +KGFG RWR W++GC+++ NFS+F+NGR +GKI A RGLRQGDPLSPFLFTL+ DSFSRL++FCCE+R+++                  
Subjt:  VDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLIHFCCEKRVLR------------------

Query:  ---------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWEIPSWVGLIE
                                   GL +N++KTS+IG+N+ +   +    + GC++E +SF+ LG PLGGNHR  +F      W  L+E
Subjt:  ---------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWEIPSWVGLIE

A0A803PZR9 Uncharacterized protein1.9e-4043.81Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++ETV D  + GK G++ K+D+EKAYD+V+WD+LD ++ KKGFG+ WR+W+RGCL+TT+FSIF+NGR +GK  A RGLRQGDPLSPFLFTL+ D   R++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  H----------------------------FCCEKRVLRGLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWE
                                     FC     L GL +N+SK+ ++GL + +  +S  AA+ GC++     K LG PL G+ R+ +F W+
Subjt:  H----------------------------FCCEKRVLRGLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWE

A0A803QI00 Uncharacterized protein1.2e-3739.52Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++ETV D  + GK+G + K+D EKAYD+VDWD+LD +LK+KGFG  WR+W+RGC+++T+FS+ +NGR +GK    RGLRQGDPLSPFLFTL+VD   RL+
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  -------------------------------HFCCEKRVLR-------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG
                                        F  ++  LR             GL +N++K+ ++G+++ +E ++  A   GC++ +     LG PLGG
Subjt:  -------------------------------HFCCEKRVLR-------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG

Query:  NHRRGLFDWE
        + R+G F WE
Subjt:  NHRRGLFDWE

A5AE29 Reverse transcriptase domain-containing protein7.8e-4235.66Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +G+ G++ K+D+EKAYD V WD+LD +L+KKGF  +WR+W+ GCL++ ++++ +NG ++G + A RGLRQGDPLSPFLFTL+ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLR----------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG----------------NHRRGLFDWEIP
            ++ +L                 GL +++ K++I G+N+ +  +S L     CK        LG PLGG                N RR L D+EI 
Subjt:  HFCCEKRVLR----------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGG----------------NHRRGLFDWEIP

Query:  SWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLWSG
           GL+  LD + L     D   W L   G+FS K  F  LS  S S +      +W +  P KV+ F+  G
Subjt:  SWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLWSG

A5BM49 Reverse transcriptase domain-containing protein3.0e-3832.44Show/hide
Query:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI
        ++E V +   +G+ G++ K+D+EKAYD + WD+LD +L+KKGF  +WR+W+ GCL++ ++++ +NG ++G + A RGLRQGDPLSPFLFTL+ D  SR++
Subjt:  SSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLI

Query:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG
            ++ +L                                              GL +N+ K++I  +N+ +  +S LA    CK        LG PLG
Subjt:  HFCCEKRVLR---------------------------------------------GLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLG

Query:  G----------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW
                         N RR L D EI     L+  LD + L     D   W L  SG+FS KS F  LS  S S +      +W +  P KV+ F+W
Subjt:  G----------------NHRRGLFDWEIPSWVGLIEILDTISLGGG-RDVCLWSLEKSGIFSSKSAFWKLSDSSPSLKTPLIDAIWKNHAPKKVKVFLW

SwissProt top hitse value%identityAlignment
Q39127 Ethylene-responsive transcription factor TINY1.6e-1274Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHM
        MAARAHDVA  SIKG+SA LNFPDL    PRP SL+PRD+Q AA KAAHM
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHM

Q52QU1 Ethylene-responsive transcription factor ERF0422.1e-1260.87Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEI
        MAARAHDVA  SIKGSSA LNFP+L   LPRPVSL+ +D+QAAAA+AA M   D      H+ DD T +
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEI

Q9LYD3 Dehydration-responsive element-binding protein 31.6e-1245.1Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHN--------------------------LDLDDHHSHVDDDLTEIIQLPT
        MAARAHDVA  SIKG++A LNFP+L    PRPVSL+PRD+Q AA KAAHM                            +DL    S   ++L EI++LP+
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHN--------------------------LDLDDHHSHVDDDLTEIIQLPT

Query:  LG
        LG
Subjt:  LG

Q9M080 Ethylene-responsive transcription factor ERF0434.7e-1261.76Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTE
        MA RAHDVA  SIKG+SA LNFP+L  LLPRPVSL+PRDV+AAA KAA M   D D      D + +E
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTE

Q9M210 Ethylene-responsive transcription factor ERF0354.0e-1137.21Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEIIQLPTLGLRSIHRKKCSDLFSVLNHSHGQPS
        MAARAHDVA  +IKG+S  LNFP+L  LLPRPVS +P+D+QAAA KAA             + D+L+    L T    +      S   S  + +  + +
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEIIQLPTLGLRSIHRKKCSDLFSVLNHSHGQPS

Query:  NRTTFDYQGYIIFMKTTSSETVGDCLANG
          T FD     +F     +     CL NG
Subjt:  NRTTFDYQGYIIFMKTTSSETVGDCLANG

Arabidopsis top hitse value%identityAlignment
AT2G25820.1 Integrase-type DNA-binding superfamily protein1.5e-1360.87Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEI
        MAARAHDVA  SIKGSSA LNFP+L   LPRPVSL+ +D+QAAAA+AA M   D      H+ DD T +
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEI

AT3G60490.1 Integrase-type DNA-binding superfamily protein2.8e-1237.21Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEIIQLPTLGLRSIHRKKCSDLFSVLNHSHGQPS
        MAARAHDVA  +IKG+S  LNFP+L  LLPRPVS +P+D+QAAA KAA             + D+L+    L T    +      S   S  + +  + +
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEIIQLPTLGLRSIHRKKCSDLFSVLNHSHGQPS

Query:  NRTTFDYQGYIIFMKTTSSETVGDCLANG
          T FD     +F     +     CL NG
Subjt:  NRTTFDYQGYIIFMKTTSSETVGDCLANG

AT4G32800.1 Integrase-type DNA-binding superfamily protein3.4e-1361.76Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTE
        MA RAHDVA  SIKG+SA LNFP+L  LLPRPVSL+PRDV+AAA KAA M   D D      D + +E
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTE

AT5G11590.1 Integrase-type DNA-binding superfamily protein1.2e-1345.1Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHN--------------------------LDLDDHHSHVDDDLTEIIQLPT
        MAARAHDVA  SIKG++A LNFP+L    PRPVSL+PRD+Q AA KAAHM                            +DL    S   ++L EI++LP+
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHN--------------------------LDLDDHHSHVDDDLTEIIQLPT

Query:  LG
        LG
Subjt:  LG

AT5G25810.1 Integrase-type DNA-binding superfamily protein1.2e-1374Show/hide
Query:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHM
        MAARAHDVA  SIKG+SA LNFPDL    PRP SL+PRD+Q AA KAAHM
Subjt:  MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCCGTGCCCACGACGTCGCAACCTTCAGCATCAAAGGCTCCTCCGCCACCCTCAATTTCCCCGACCTCCTCCACCTCCTCCCCCGCCCCGTCTCCCTCGCCCC
TCGCGACGTCCAAGCCGCCGCCGCCAAGGCCGCCCACATGCACAACCTCGACCTCGACGACCACCACTCCCACGTCGACGACGACCTCACCGAGATCATCCAACTCCCCA
CCCTCGGCCTCCGATCCATCCACCGAAAAAAATGCTCAGACCTATTTAGCGTCCTCAATCATTCTCATGGACAACCGTCTAACCGTACCACATTTGATTATCAAGGGTAC
ATAATATTCATGAAAACCACGTCTTCTGAAACGGTTGGGGATTGTCTTGCAAATGGAAAGAGAGGCATCCTTTTAAAGCTTGATTATGAAAAGGCTTATGATAAAGTTGA
TTGGGATTACTTGGACTTCATCTTGAAAAAGAAAGGCTTTGGGAATAGATGGAGAAGATGGGTGAGGGGATGTTTGACTACGACAAATTTCTCAATTTTTCTAAATGGTA
GATCCCAGGGGAAGATTGTGGCTAAGAGGGGGCTAAGACAAGGGGACCCCTTGTCCCCTTTTCTTTTCACGTTGATGGTGGATTCCTTTAGTAGATTAATCCATTTTTGC
TGTGAGAAGAGAGTTCTTAGAGGCCTATCCTTAAACGTCTCGAAGACTTCTATTATAGGCCTTAATGTGTCTAAAGAGTGGTTGTCTGATCTTGCTGCTAGATTTGGGTG
TAAGATTGAATCTCTGTCGTTTAAATGTTTGGGCTTTCCTCTTGGAGGAAACCACAGAAGAGGCCTCTTTGACTGGGAGATTCCTAGTTGGGTTGGGCTGATTGAGATCC
TCGATACTATTAGCTTGGGTGGTGGCAGAGACGTTTGTTTATGGAGTCTTGAGAAATCTGGGATCTTCTCCTCCAAATCGGCCTTTTGGAAATTATCAGATTCCTCCCCC
TCCCTCAAGACCCCGTTGATCGATGCCATCTGGAAGAACCATGCCCCAAAGAAGGTTAAGGTTTTTCTTTGGTCTGGTCCCTTGCCCATAGAAGCCTTAACACTTCAGAC
CTTCCTTAGAAAAAGTGCAGAAATAGAATGCTCTCCCCTTTGGTGTGCCCCCTGTGCTGGGCAAATTCGGAATCCCTTGACCATCTTTTCCTCCTTCGTCCTTTTGCTGC
TGCGAGTTGGGGTGGCTGCTGGGGGAATTCAACCTCTCCATTCCCTTGTCGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCCGTGCCCACGACGTCGCAACCTTCAGCATCAAAGGCTCCTCCGCCACCCTCAATTTCCCCGACCTCCTCCACCTCCTCCCCCGCCCCGTCTCCCTCGCCCC
TCGCGACGTCCAAGCCGCCGCCGCCAAGGCCGCCCACATGCACAACCTCGACCTCGACGACCACCACTCCCACGTCGACGACGACCTCACCGAGATCATCCAACTCCCCA
CCCTCGGCCTCCGATCCATCCACCGAAAAAAATGCTCAGACCTATTTAGCGTCCTCAATCATTCTCATGGACAACCGTCTAACCGTACCACATTTGATTATCAAGGGTAC
ATAATATTCATGAAAACCACGTCTTCTGAAACGGTTGGGGATTGTCTTGCAAATGGAAAGAGAGGCATCCTTTTAAAGCTTGATTATGAAAAGGCTTATGATAAAGTTGA
TTGGGATTACTTGGACTTCATCTTGAAAAAGAAAGGCTTTGGGAATAGATGGAGAAGATGGGTGAGGGGATGTTTGACTACGACAAATTTCTCAATTTTTCTAAATGGTA
GATCCCAGGGGAAGATTGTGGCTAAGAGGGGGCTAAGACAAGGGGACCCCTTGTCCCCTTTTCTTTTCACGTTGATGGTGGATTCCTTTAGTAGATTAATCCATTTTTGC
TGTGAGAAGAGAGTTCTTAGAGGCCTATCCTTAAACGTCTCGAAGACTTCTATTATAGGCCTTAATGTGTCTAAAGAGTGGTTGTCTGATCTTGCTGCTAGATTTGGGTG
TAAGATTGAATCTCTGTCGTTTAAATGTTTGGGCTTTCCTCTTGGAGGAAACCACAGAAGAGGCCTCTTTGACTGGGAGATTCCTAGTTGGGTTGGGCTGATTGAGATCC
TCGATACTATTAGCTTGGGTGGTGGCAGAGACGTTTGTTTATGGAGTCTTGAGAAATCTGGGATCTTCTCCTCCAAATCGGCCTTTTGGAAATTATCAGATTCCTCCCCC
TCCCTCAAGACCCCGTTGATCGATGCCATCTGGAAGAACCATGCCCCAAAGAAGGTTAAGGTTTTTCTTTGGTCTGGTCCCTTGCCCATAGAAGCCTTAACACTTCAGAC
CTTCCTTAGAAAAAGTGCAGAAATAGAATGCTCTCCCCTTTGGTGTGCCCCCTGTGCTGGGCAAATTCGGAATCCCTTGACCATCTTTTCCTCCTTCGTCCTTTTGCTGC
TGCGAGTTGGGGTGGCTGCTGGGGGAATTCAACCTCTCCATTCCCTTGTCGGATAA
Protein sequenceShow/hide protein sequence
MAARAHDVATFSIKGSSATLNFPDLLHLLPRPVSLAPRDVQAAAAKAAHMHNLDLDDHHSHVDDDLTEIIQLPTLGLRSIHRKKCSDLFSVLNHSHGQPSNRTTFDYQGY
IIFMKTTSSETVGDCLANGKRGILLKLDYEKAYDKVDWDYLDFILKKKGFGNRWRRWVRGCLTTTNFSIFLNGRSQGKIVAKRGLRQGDPLSPFLFTLMVDSFSRLIHFC
CEKRVLRGLSLNVSKTSIIGLNVSKEWLSDLAARFGCKIESLSFKCLGFPLGGNHRRGLFDWEIPSWVGLIEILDTISLGGGRDVCLWSLEKSGIFSSKSAFWKLSDSSP
SLKTPLIDAIWKNHAPKKVKVFLWSGPLPIEALTLQTFLRKSAEIECSPLWCAPCAGQIRNPLTIFSSFVLLLLRVGVAAGGIQPLHSLVG