; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038861 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038861
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:29348797..29350047
RNA-Seq ExpressionLag0038861
SyntenyLag0038861
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY61101.1 hypothetical protein CUMW_207140, partial [Citrus unshiu]2.9e-2726.9Show/hide
Query:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN
        V+EL+  +N W+  L+   F + DA+VI QIP PR   +D+L+W + K+G YTVKSGY+ A  +R  A  S+ E     WN +W+  +P KI+I  WR  
Subjt:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN

Query:  KEVFG----------------------------------------------------------------ERSAGGSEED---GWVWVSEYLSHFRAFCGR
        K +                                                                  +R    ++ D     +W      +   F G+
Subjt:  KEVFG----------------------------------------------------------------ERSAGGSEED---GWVWVSEYLSHFRAFCGR

Query:  R------IAGGLARREGIR-------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKE
        R      +A   A  E  +             W PP     K+NT+AA   E N + LGA+IRDE G+V  T++K+ ++   V   +A  +   L VAK+
Subjt:  R------IAGGLARREGIR-------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKE

Query:  AGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE
        A  + + +E+DS  V +++ + Q + SE+  +V EI ++ + F   S  +  R  N +AH+ A+  LE
Subjt:  AGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE

XP_024033483.1 uncharacterized protein LOC112095606 [Citrus clementina]5.4e-2926.69Show/hide
Query:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN
        V EL+  +N W+   +   F   DA+ IV+IP PR   +D ++W Y+K GLY+VKSGY+ A  L+  A   +  S    WN +W   +P KI+I  WR  
Subjt:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN

Query:  KEVFGERSAGGSEEDGW--------------------------------VW--------VSEYLSHFRAFCGRR------IAGGLARREGIR--------
        K +        S E+ W                                +W        + +   +   F G+R      +A   A  E  +        
Subjt:  KEVFGERSAGGSEEDGW--------------------------------VW--------VSEYLSHFRAFCGRR------IAGGLARREGIR--------

Query:  ------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIR
                    W PP S   K+N +AA   E + + LGAIIRD+ G V+  ++K+ ++  DV   +A  +   L +A+ A  + L VE+D+  V  ++ 
Subjt:  ------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIR

Query:  SNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWL
        + Q   SE+  ++ EI  + + F   S+++  R  N +AH+ A+  LE      W+
Subjt:  SNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWL

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]6.5e-2726.4Show/hide
Query:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN
        V+EL+  +N W+  L+   F + DA+VI QIP PR + +D+L+W + K+G YTVKSGY+ A  +R  A  S+ ES    WN +W+  +P KI+I  WR  
Subjt:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN

Query:  KEVFG----------------------------------------------------------------ERSAGGSEEDGW---VWVSEYLSHFRAFCGR
        K +                                                                  +R    ++ D +   +W      +   F G+
Subjt:  KEVFG----------------------------------------------------------------ERSAGGSEEDGW---VWVSEYLSHFRAFCGR

Query:  R------IAGGLARREGIR--------------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRD
        R      +A   A  E  +                    W PP     K+NT+AA   E N + LGA+IRDE G+V  T++K+ ++   V   +A  +  
Subjt:  R------IAGGLARREGIR--------------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRD

Query:  SLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE
         L VAK+A  + + +E+DS  V +++ + Q + SE+  +V EI ++ + F   S  +  R  N +AH+  +  LE
Subjt:  SLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE

XP_030479476.1 uncharacterized protein LOC115696730 [Cannabis sativa]2.1e-2529.49Show/hide
Query:  MKVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWR
        + V   +     WN+ L+ + FQ  D + IV IP     S D+L+W +   G YTV SG+ +A  L E   +S   +   WW   W+  +PSK K     
Subjt:  MKVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWR

Query:  RNKEVFGERSAGGSEEDGWVWVSEYLSHFRAFCGRRIAGGLARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVD
        +  ++                      H+R     R A   +  + + W PP  L  K+N +AAV +E     +GAIIRD  G V+    K VQ     D
Subjt:  RNKEVFGERSAGGSEEDGWVWVSEYLSHFRAFCGRRIAGGLARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVD

Query:  ALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWLEEV
         L+A  +  SL  AK+   +   VETD+ RV++ I S  +N S    L+ ++  +   F   +VS  +R AN  A+  A+  L L     W+ E+
Subjt:  ALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWLEEV

XP_042965938.1 uncharacterized protein LOC122299618 [Carya illinoinensis]4.7e-2526.35Show/hide
Query:  LLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNV---ESIGRWWNFLWNRRIPSKIKILCWRRNK
        L+S+  W+V L++++F++ + E I  IP  +  ++DKL+W     G +T++S Y++     E A   N    E   RW + +W+  I  K+K+  WR  K
Subjt:  LLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNV---ESIGRWWNFLWNRRIPSKIKILCWRRNK

Query:  EVFGERS-------AGGSEEDGWVWVSEYLSHFRAFCGRRI---AGGL--------------ARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIR
         +   RS          S+        E  SH    C   I   A  L              + R  +RW  P     K N +AAV ++     +G +IR
Subjt:  EVFGERS-------AGGSEEDGWVWVSEYLSHFRAFCGRRI---AGGL--------------ARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIR

Query:  DEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAA
        DEEG V++ + + + Y+ D    ++  +R +L V ++  F  +  E D+  +   + +  ++ S  G +++++  + K      VS+  REAN  AH  A
Subjt:  DEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAA

Query:  RQVLELGVQGTWLEE
        R VL    +  W+E+
Subjt:  RQVLELGVQGTWLEE

TrEMBL top hitse value%identityAlignment
A0A2H5Q972 Uncharacterized protein (Fragment)1.4e-2726.9Show/hide
Query:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN
        V+EL+  +N W+  L+   F + DA+VI QIP PR   +D+L+W + K+G YTVKSGY+ A  +R  A  S+ E     WN +W+  +P KI+I  WR  
Subjt:  VNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRN

Query:  KEVFG----------------------------------------------------------------ERSAGGSEED---GWVWVSEYLSHFRAFCGR
        K +                                                                  +R    ++ D     +W      +   F G+
Subjt:  KEVFG----------------------------------------------------------------ERSAGGSEED---GWVWVSEYLSHFRAFCGR

Query:  R------IAGGLARREGIR-------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKE
        R      +A   A  E  +             W PP     K+NT+AA   E N + LGA+IRDE G+V  T++K+ ++   V   +A  +   L VAK+
Subjt:  R------IAGGLARREGIR-------------WLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKE

Query:  AGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE
        A  + + +E+DS  V +++ + Q + SE+  +V EI ++ + F   S  +  R  N +AH+ A+  LE
Subjt:  AGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLE

A0A2N9GIC4 Reverse transcriptase domain-containing protein2.6e-2925Show/hide
Query:  VNELLL-SDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERA--SSSNVESIGRWWNFLWNRRIPSKIKILCW
        V++L++ +  +W+  L+ ++F  YDAE I QIP       DK++W    NG YTV+SGYR      +++   SS    +   W  +W+ +IP K ++  W
Subjt:  VNELLL-SDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERA--SSSNVESIGRWWNFLWNRRIPSKIKILCW

Query:  RRNKEVFGER---------------SAGGSEEDGW--VWVSEYLSHFRAFCGRRIAGGLARREG---------------IRWLPPNSLNYKLNTNAAVCR
        + ++E    +                 G  +ED    +W  +   H ++   + +   + R  G               +RW+P     YK+N + AV +
Subjt:  RRNKEVFGER---------------SAGGSEEDGW--VWVSEYLSHFRAFCGRRIAGGLARREG---------------IRWLPPNSLNYKLNTNAAVCR

Query:  ETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWC
        ETN++ +G I+RD    VM +  + V++   + +++A  ++ S+    E G    E E DS  + A +   + + +  G+L+ +   ++ +    S S  
Subjt:  ETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWC

Query:  RREANLVAHTAARQVLELGVQGTWLEEVLVSLDEVY
        +R+ N +AH  AR+ L       W+E V   L+ +Y
Subjt:  RREANLVAHTAARQVLELGVQGTWLEEVLVSLDEVY

A0A2N9HEC7 Reverse transcriptase domain-containing protein5.4e-2726.16Show/hide
Query:  VNELLLSDN-SWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRER--ASSSNVESIGRWWNFLWNRRIPSKIKILCW
        V++L++    +W+  L+ S+F  YDAE I QIP       DK++W    NG YTV+SGYR      ++    SS    +   W  +W+ +IP K ++  W
Subjt:  VNELLLSDN-SWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRER--ASSSNVESIGRWWNFLWNRRIPSKIKILCW

Query:  RRNKEVFGER---------------SAGGSEEDG----W-------VWVSEYLSHFRA--------FCGRRIAGGLAR--RE------------GIRWLP
        + ++EV   +                 G  +ED     W       VW +E  +  R           G +    L    RE             +RW+P
Subjt:  RRNKEVFGER---------------SAGGSEEDG----W-------VWVSEYLSHFRA--------FCGRRIAGGLAR--RE------------GIRWLP

Query:  PNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQE
             YK+N + AV +ETN++ +G I+RD  G VM +  + V++   V +++A  ++ S+    E G    + E DS  + A +   + + +  G+L+ +
Subjt:  PNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQE

Query:  ISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWLEEV
           ++ +    S S  +R+ N +AH  AR+ L       W+E V
Subjt:  ISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWLEEV

A0A803NU77 Uncharacterized protein8.3e-2828.25Show/hide
Query:  KVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRR
        +V+ L+  +  WN+ L+ + F   D + I+QIP     + D+L+W YE NG YTVKSGY +A  L E+  + +      WWN  W+  +PSK++I  WR 
Subjt:  KVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRR

Query:  NKEVFGERSA--GGSEEDGWVWVSEYLSHFRAFCGRRIA-------GGLARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKL
          +     +        D  + +    ++F+A      A         L +++   WL P +   KLNT+AA+ +ET ++  GAI+R+ +G V+    K 
Subjt:  NKEVFGERSA--GGSEEDGWVWVSEYLSHFRAFCGRRIA-------GGLARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKL

Query:  VQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWL
        V      + ++A+ +   L    + G     +ETDS  VA  ++S   + S    L+ +I+ +   F    +    R AN  AH   +  L +    +WL
Subjt:  VQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWL

Query:  EEVLVSLD
        EE+ + L+
Subjt:  EEVLVSLD

A0A803PC16 Uncharacterized protein2.4e-2728.34Show/hide
Query:  MKVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWR
        MKV++L+L+   WN GL+ ++F   D  +I  IP       D ++W +E  G+Y+VKSGY +A  L E+   S+     +WW   W  ++PSKI+I  WR
Subjt:  MKVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWR

Query:  RNKE-------VFGERSAGGSEEDGWVWVSEYLSHFRAFCGR----------RIAGG--LARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDE
           +       +     A  S        +E + H   +C R          R++ G   A  +   WL P S   KLNT+AA+    N S  GA++RD 
Subjt:  RNKE-------VFGERSAGGSEEDGWVWVSEYLSHFRAFCGR----------RIAGG--LARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDE

Query:  EGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQ
         G++              D  + + +  +L   K+       +ETDS  V   + S+++  S+   L+  IS +   F    ++   R AN  AH  AR 
Subjt:  EGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFRRLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQ

Query:  VLELGVQGTWLEEV
         L +    +W+EE+
Subjt:  VLELGVQGTWLEEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)8.3e-0435.48Show/hide
Query:  VNELL-LSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMA
        VN+L+  + N W +  +Q++    D  +I+ I   R    D   W + K+G YTVKSGY +A
Subjt:  VNELL-LSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMA

AT3G09510.1 Ribonuclease H-like superfamily protein6.4e-0427.52Show/hide
Query:  MKVNELLLSDNS---WNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIG------RWWNFLWNRRIP
        M +N L     S   W+   +     + D   I +I   +    DK++W Y   G YTV+SGY     L     S+N+ +I            +WN  I 
Subjt:  MKVNELLLSDNS---WNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIG------RWWNFLWNRRIP

Query:  SKIKILCWR
         K+K   WR
Subjt:  SKIKILCWR

AT4G29090.1 Ribonuclease H-like superfamily protein3.4e-0525.96Show/hide
Query:  MKVNELL-LSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGY-RMACLLRERASSSNVE--SIGRWWNFLWNRRIPSKIKI
        +KV++L+  S   W   +++ +F E + ++I ++        D   W Y  +G YTVKSGY  +  ++ +R+S   V   S+   +  +W  +   KI+ 
Subjt:  MKVNELL-LSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGY-RMACLLRERASSSNVE--SIGRWWNFLWNRRIPSKIKI

Query:  LCWR
          W+
Subjt:  LCWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTTAATGAGTTACTGCTGTCGGATAACTCATGGAATGTGGGCCTAGTCCAGTCAGTGTTCCAGGAATATGATGCGGAAGTTATTGTGCAAATACCAAGGCCACG
GTGCATGAGCCAGGATAAGCTTGTTTGGCAGTACGAGAAAAATGGATTGTACACAGTTAAGAGTGGCTATAGAATGGCTTGTTTGTTGAGGGAGAGGGCAAGTAGCTCTA
ATGTGGAATCGATTGGGAGGTGGTGGAATTTTCTGTGGAATAGGCGGATACCGAGCAAAATCAAGATATTATGCTGGAGGCGGAATAAGGAGGTTTTTGGTGAAAGAAGT
GCTGGAGGGAGTGAGGAGGATGGTTGGGTTTGGGTATCTGAGTACCTATCCCATTTCAGGGCTTTCTGTGGTAGGAGGATTGCTGGGGGTTTGGCCCGAAGGGAAGGGAT
ACGGTGGTTGCCTCCTAATTCACTAAATTACAAACTCAACACAAATGCAGCAGTATGTAGAGAGACAAATTCAAGCAGTCTAGGGGCTATTATTCGGGATGAAGAAGGGA
GAGTTATGCTTACCTCGATGAAGTTGGTCCAATATGTGCAGGATGTGGACGCGCTAAAGGCAATGGTGATCCGCGACAGTCTGATAGTTGCGAAAGAAGCGGGCTTCCGA
CGACTGGAGGTGGAGACTGATTCAGCTCGGGTGGCGGCCATGATTCGGTCAAACCAGAAGAATTGCTCTGAGGTGGGAGTTCTGGTTCAGGAGATAAGTCAGATCTCGAA
GGAATTTTTGTTCTGTTCTGTGAGCTGGTGCCGGCGGGAGGCTAATCTGGTGGCGCACACGGCGGCGCGGCAAGTGCTGGAGCTTGGAGTTCAAGGCACCTGGTTAGAAG
AGGTACTGGTGTCGTTGGACGAGGTTTATCGCAGAGAGCGTTTGGACAGTAGAGGAGAGAGATCGAGTTGCTTGTCTGAGGGCTTTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTTAATGAGTTACTGCTGTCGGATAACTCATGGAATGTGGGCCTAGTCCAGTCAGTGTTCCAGGAATATGATGCGGAAGTTATTGTGCAAATACCAAGGCCACG
GTGCATGAGCCAGGATAAGCTTGTTTGGCAGTACGAGAAAAATGGATTGTACACAGTTAAGAGTGGCTATAGAATGGCTTGTTTGTTGAGGGAGAGGGCAAGTAGCTCTA
ATGTGGAATCGATTGGGAGGTGGTGGAATTTTCTGTGGAATAGGCGGATACCGAGCAAAATCAAGATATTATGCTGGAGGCGGAATAAGGAGGTTTTTGGTGAAAGAAGT
GCTGGAGGGAGTGAGGAGGATGGTTGGGTTTGGGTATCTGAGTACCTATCCCATTTCAGGGCTTTCTGTGGTAGGAGGATTGCTGGGGGTTTGGCCCGAAGGGAAGGGAT
ACGGTGGTTGCCTCCTAATTCACTAAATTACAAACTCAACACAAATGCAGCAGTATGTAGAGAGACAAATTCAAGCAGTCTAGGGGCTATTATTCGGGATGAAGAAGGGA
GAGTTATGCTTACCTCGATGAAGTTGGTCCAATATGTGCAGGATGTGGACGCGCTAAAGGCAATGGTGATCCGCGACAGTCTGATAGTTGCGAAAGAAGCGGGCTTCCGA
CGACTGGAGGTGGAGACTGATTCAGCTCGGGTGGCGGCCATGATTCGGTCAAACCAGAAGAATTGCTCTGAGGTGGGAGTTCTGGTTCAGGAGATAAGTCAGATCTCGAA
GGAATTTTTGTTCTGTTCTGTGAGCTGGTGCCGGCGGGAGGCTAATCTGGTGGCGCACACGGCGGCGCGGCAAGTGCTGGAGCTTGGAGTTCAAGGCACCTGGTTAGAAG
AGGTACTGGTGTCGTTGGACGAGGTTTATCGCAGAGAGCGTTTGGACAGTAGAGGAGAGAGATCGAGTTGCTTGTCTGAGGGCTTTTGTTAG
Protein sequenceShow/hide protein sequence
MKVNELLLSDNSWNVGLVQSVFQEYDAEVIVQIPRPRCMSQDKLVWQYEKNGLYTVKSGYRMACLLRERASSSNVESIGRWWNFLWNRRIPSKIKILCWRRNKEVFGERS
AGGSEEDGWVWVSEYLSHFRAFCGRRIAGGLARREGIRWLPPNSLNYKLNTNAAVCRETNSSSLGAIIRDEEGRVMLTSMKLVQYVQDVDALKAMVIRDSLIVAKEAGFR
RLEVETDSARVAAMIRSNQKNCSEVGVLVQEISQISKEFLFCSVSWCRREANLVAHTAARQVLELGVQGTWLEEVLVSLDEVYRRERLDSRGERSSCLSEGFC