; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011089 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011089
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:14106828..14110555
RNA-Seq ExpressionLag0011089
SyntenyLag0011089
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]5.2e-8726.02Show/hide
Query:  LTERETHAIEAVDNSRL-FAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMVSPRE
        LTE+E  A+ +V+ SRL    E    CL  K+L  K+ N +AF+  M+ +WR  +  +         L++F    +K +V++ GPW+FDK L+++     
Subjt:  LTERETHAIEAVDNSRL-FAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMVSPRE

Query:  NDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWD-GQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVW---------------
          Q+G+++     FW+++H++PL      + +L+G  +G + E++ D G+  W G  +R+RV+++I KPL R  ++  ++G   W               
Subjt:  NDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWD-GQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVW---------------

Query:  ----------------------CPIHVWCS----EDERSGG--GGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGGSDEVIEP---------------G
                               P   W         RSGG                D +E    V+++  +  G G++ V EP                
Subjt:  ----------------------CPIHVWCS----EDERSGG--GGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGGSDEVIEP---------------G

Query:  KGGTE---------NEPVRLGSREGGVGDAVQEGTE--------------------------------------VVDKGKGLWVEGGGVERGNEVRNGTD
        KG +          NE V +      VG   +E +E                                      +V K  G       V+R    R G+ 
Subjt:  KGGTE---------NEPVRLGSREGGVGDAVQEGTE--------------------------------------VVDKGKGLWVEGGGVERGNEVRNGTD

Query:  LIIKDVSGEVERRVGEGAVGVGINVSEG-----------GWEEVEEEG-EGWGSGDGLRRGEPVGQE-----VSEYGGKKD---EERERSEPLKKKVESV
         + K  +   E R+  G     +  SEG           G   VE E  +G      L R   + +E     +S  G  +D   E     EP +    +V
Subjt:  LIIKDVSGEVERRVGEGAVGVGINVSEG-----------GWEEVEEEG-EGWGSGDGLRRGEPVGQE-----VSEYGGKKD---EERERSEPLKKKVESV

Query:  GKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEG-HWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFR
        G+SGG  LFWK  + + + S SR HI   ++  +G  W  TG YG PE   R                                  GG++R++ QM+ FR
Subjt:  GKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEG-HWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFR

Query:  DAIDDCGLSDMGFRGHPFTWE-------IVEKR--------------------------------WNDP-----------------------------GH
          + DC L D+G+ G PFTW        +V++R                                W D                                
Subjt:  DAIDDCGLSDMGFRGHPFTWE-------IVEKR--------------------------------WNDP-----------------------------GH

Query:  VHCLNQGSL-MDAWKVRVKGCADVLSGWGRTRKGNYASKISLARQQLQVAMSNQG-------------DVQETLD-------------------------
          C   GS+ +D    R+  CA  L  W +T  G+    ++ A+++LQ   +N               +VQ+ L+                         
Subjt:  VHCLNQGSL-MDAWKVRVKGCADVLSGWGRTRKGNYASKISLARQQLQVAMSNQG-------------DVQETLD-------------------------

Query:  ---GSTKGH--------------PRGE--------------------ELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFY
            ST+                 +G+                    ++ + L+ ++  V+   N  L KP+  EE+  +LKQM P+KAPG DG+  LF+
Subjt:  ---GSTKGH--------------PRGE--------------------ELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFY

Query:  QKFWGIVG---------------------------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALK
        QK+WG++G                                                    ++QSAFVP R + DNV++ +E LH++R +++G +G+ +LK
Subjt:  QKFWGIVG---------------------------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALK

Query:  LDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSP
        LDMS AYDRVEW FLEK+M +LG D K+++L+M CVRTVS+S LVNG   G + PSRG+RQGDP+SPYLFLLC EGL  +L        V G++I RG+P
Subjt:  LDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSP

Query:  SISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLP
         I+HL FADD     KA  S    +   LS Y   +GQ +N  K+ + FS NV    K+ I    G      +++YLGLP
Subjt:  SISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLP

OMO61345.1 reverse transcriptase [Corchorus capsularis]9.5e-9730.32Show/hide
Query:  KKWELTERETHAIEAVDNSRLF--AAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVM
        + + LTE E   +E   + RL      E + CL  K+LS + +N +  RN+M  VW++    ++   G+N+F+ +F S LEK RV +  PWNF+K+L+V+
Subjt:  KKWELTERETHAIEAVDNSRLF--AAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVM

Query:  VSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEV------------
         S    D V ++     SFW Q H++PL  M   + +++GE  G +EEI+  G     G  +R R  +++ KPL+R + +   +G ++            
Subjt:  VSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEV------------

Query:  ----WCPIHV-----------------------WC-----------------------SEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGG
             C  HV                       W                         E+E+        EG G  G + Q   ++  AR+  D     
Subjt:  ----WCPIHV-----------------------WC-----------------------SEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGG

Query:  SDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTE-------VVDKGKGLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVSEGGWEE
         D     G   T  E V   + +  VGD+ Q+  E        V   K ++ +G    +   + +G  L  K     V + V E +   G +  +  W+ 
Subjt:  SDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTE-------VVDKGKGLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVSEGGWEE

Query:  VEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSN
                  G   +  +  GQ+     G   E R++S+    +V+S  +SGG  L WK   +V + S S  H D +V   +G   WRFTGFYG+P    
Subjt:  VEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSN

Query:  R----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWEIVEKRWNDPGHVHCLNQGSLMDAWKVRVKG
        R                                  GGS+R   Q+Q FR+ I DC L  +   G   TW    +   +      L++  +  +W+ R   
Subjt:  R----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWEIVEKRWNDPGHVHCLNQGSLMDAWKVRVKG

Query:  CADVLSGWGRTRKGNYASKISLARQQLQVAMSN--QGDVQETLDGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGED
                   +K N   KI  AR +  V   +  Q  V       T  H R  +   F  +I+  +S     +L K +   E+  ++ QM P+KA G D
Subjt:  CADVLSGWGRTRKGNYASKISLARQQLQVAMSN--QGDVQETLDGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGED

Query:  GLPALFYQKFWGIVG--------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGM
        G+ A+FYQK+W ++G                     E QSAFVP R + DN ++ +E +HY++ +K G  G  ALKLDMSKAYDRVEW +LE++M  +G 
Subjt:  GLPALFYQKFWGIVG--------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGM

Query:  DVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARW
            ++LIM  VRTVSYS +VNGK+     PSRGIRQGDPISPYLFLLC EGLS +L+  +++G + GV + R +P I+HL FADD  LF +A  +EA  
Subjt:  DVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARW

Query:  VSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ
        +   L  Y   +GQ VN+ KS V FS N  +  ++AI   LGV      D+YLG+P +    K    +++KD+
Subjt:  VSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ

XP_021771467.1 uncharacterized protein LOC110735585 [Chenopodium quinoa]1.5e-8927.69Show/hide
Query:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK
        ++L  R +K  L E+E   ++  + +     E++ L L  ++L  + +N +AF+  +   W +     I   G+N+++ +F    +K +V+   PW F++
Subjt:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK

Query:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDG-------KEV
        +L+++ S + ++Q  ++  T   FW+++ N+P NC + +  + +   +G + EIE +  +       R++V +D+ +PL+R+  IKTK+G       K  
Subjt:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDG-------KEV

Query:  WCPIH--------------VWCSEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGGSDEVIEPGKGGTENEPVRLGSREGG--------VGD
          P                +   E+  + G G+ +  +       ++EGE       G K      E I+ G       PV      GG        VGD
Subjt:  WCPIH--------------VWCSEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGGSDEVIEPGKGGTENEPVRLGSREGG--------VGD

Query:  AVQEGTEVVDKGKGLWVEGGGVE---RG---NEVRNGTDLIIK----DVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGEGWGSGDGLRRGEPVGQEVS
          Q+G E +D+  G+  EGG VE   +G   +EV+ G    +K        E+   +GEGA+G  I ++                   L     VG++  
Subjt:  AVQEGTEVVDKGKGLWVEGGGVE---RG---NEVRNGTDLIIK----DVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGEGWGSGDGLRRGEPVGQEVS

Query:  EYGGKKDEERERSEPLK-KKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSNR-----------------------
          GGK++  R +    +  +V S G++GG  L+W+  + V + S S+ HI V V   + +  WR  G YG PE  ++                       
Subjt:  EYGGKKDEERERSEPLK-KKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSNR-----------------------

Query:  -----------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWE-------IVEKR---------W----------NDPGHVHCLNQGSLM-------
                   GG  R +R M  FR+A+D  G+ D+GF+G  FTW+       +V +R         W          N P  V   + G ++       
Subjt:  -----------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWE-------IVEKR---------W----------NDPGHVHCLNQGSLM-------

Query:  --------------------------DAW--------KVRVKGCADVLSGWGRTRKGNYASKISLARQQLQ---------VAMSNQGDVQETL-------
                                  D W        + ++ GCA  LS W     G+   +   A ++L+         V +S    + E L       
Subjt:  --------------------------DAW--------KVRVKGCADVLSGWGRTRKGNYASKISLARQQLQ---------VAMSNQGDVQETL-------

Query:  -----DGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVGKE---------------------
               +      GEE ++ L+ +     E +  L   P S EE+  +L +M P KAPG DGL ALFYQKFW IVG +                     
Subjt:  -----DGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVGKE---------------------

Query:  ------------------------------------------------TQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWF
                                                         QSAFVP R + DN ++ FE  HY+++R  G  G  ALKLDM KAYDRVEW 
Subjt:  ------------------------------------------------TQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWF

Query:  FLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFL
        FLEK+M  LG     V  +MRC+   S+    NG+  G +TP+RG+RQGDPISPYLFL  A+  SR LS+   EG + GVK+  G+P ISHLFFADD  L
Subjt:  FLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFL

Query:  FFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAV
        F +A  +E   ++  L+ +   +GQ++NL+KS V FS NV +  ++ I   LGV  V  H++YLG+P +
Subjt:  FFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.4e-8625.77Show/hide
Query:  LKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMV
        L K +LT  E   I   D+ RL A E  +L L  K L+ K  N  A +N ++  W ++ + +I   G N+F  KF+S+ +  R+++ GPW+FD  L+++ 
Subjt:  LKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMV

Query:  SPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP-----IHVWC
          ++   VG + F   S WIQ+ + P + ++  +A+ +G  +G +EE+E   +       +R+RV + I KP++R   I   DG + W       + ++C
Subjt:  SPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP-----IHVWC

Query:  ------SEDERSGGGGFEIEGRG-------GD-----GGRDQ------------LEGEVGVAREGGDKGGGG----------------------------
                D R   G + +E +G       GD     GGR +             E     A E   K G                              
Subjt:  ------SEDERSGGGGFEIEGRG-------GD-----GGRDQ------------LEGEVGVAREGGDKGGGG----------------------------

Query:  -SDEVIEPGKGG--TENEPVRLGSR---EGGVGDAVQEGTEVVDKGK------GLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVS-
         +DE + PG     T N   ++ S     G    A   G +  DKG+      G+    G   + NE  N  D++I+     ++  +    +    NV  
Subjt:  -SDEVIEPGKGG--TENEPVRLGSR---EGGVGDAVQEGTEVVDKGK------GLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVS-

Query:  EGGWEEVEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH-WRFTGFYGD
        +G W  V     G G          +G+      G ++   E +E  K+  +     GG    WK  + + + + +  H+   V  ++G  W  TGFYG 
Subjt:  EGGWEEVEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH-WRFTGFYGD

Query:  PELSNRGGSVRNQR----------------------------------QMQAFRDAIDDCGLSDMGFRGHPFTWE----------------IVEKRWNDP
        P    +  S R  +                                  Q++AFR+A+  C L D+GF+G+P+TW                 +  K W D 
Subjt:  PELSNRGGSVRNQR----------------------------------QMQAFRDAIDDCGLSDMGFRGHPFTWE----------------IVEKRWNDP

Query:  G--------------------HVHCLNQ----------------------GSLMDAW-------------KVRVKGCADVLSGWG---------------
                             HV   +Q                        + +AW             + ++K C   L  WG               
Subjt:  G--------------------HVHCLNQ----------------------GSLMDAW-------------KVRVKGCADVLSGWG---------------

Query:  -------------------------------------------------------------RTRKGNYASKISLARQQLQVAMSNQGDVQETLDGSTKGH
                                                                     + R+ N+   I  ++ Q    +   G V      +    
Subjt:  -------------------------------------------------------------RTRKGNYASKISLARQQLQVAMSNQGDVQETLDGSTKGH

Query:  PRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-----------------------------------
          G+++ E L  +   V+E     L   F+ EE+  +L QMGPTKAPG DG+ ALFYQKFW IVG                                   
Subjt:  PRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-----------------------------------

Query:  ----------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMD
                                            TQSAFVP R + DNV++ +E LH +  RK+G +G  ALKLD+SKAYDRVEW FL+ +M  +G  
Subjt:  ----------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMD

Query:  VKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWV
           +  +M CV T S+S LVNGK    + PSRGIRQGDPISPYLFLLCAEGL+ +L+  E  G ++GV I RG+P I++L FADD  LF +A ++E   +
Subjt:  VKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWV

Query:  SKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ
        ++ L  Y   +GQ +NL KS   FS N     K  I   LGV+ V    +YLGLP +    K  +   +KD+
Subjt:  SKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ

XP_035544642.1 uncharacterized protein LOC109020982 [Juglans regia]1.9e-8927.32Show/hide
Query:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK
        EEL  + K ++LTE+E+  +    ++   A  +   CL   +++ K IN +AFRN M  VWR     +    G+N FLI+F    ++ RV+K  PW+FD+
Subjt:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK

Query:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQN-GWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP---
         L+ + +   N  V ++ FT+E FW+QVHN+PL  MT ++ K +G+ VG + ++  D +  GW G  +RIRV + I K L R + + T DG++ W     
Subjt:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQN-GWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP---

Query:  --IHVWCSEDERSGGGGFEIEGRGGDGGRDQL----------EGEVGVAREGGDKG---GGGSDEVIEPGKGGTENEPVRLGSREG-----GVGDAVQEG
          +  +C +       G   +       ++Q           EG++   R G D      G        GK G E    +  S+EG        D +QE 
Subjt:  --IHVWCSEDERSGGGGFEIEGRGGDGGRDQL----------EGEVGVAREGGDKG---GGGSDEVIEPGKGGTENEPVRLGSREG-----GVGDAVQEG

Query:  TEVV--------DKGKGLWVEGGGVERGN----EVRNGTDL-IIKDVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGE-----------------GWGS
           +        D  + L      + + N    ++   T L  +K+          +     G +  +   +  E   +                  W  
Subjt:  TEVV--------DKGKGLWVEGGGVERGN----EVRNGTDL-IIKDVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGE-----------------GWGS

Query:  GDGLRRGEPVGQEVSEYGG-------------------KKDEERERSEPLKK--------KVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRA--D
           L + EP+G+E  E                      +   +R + E +K+         ++  G SGG    WK  ++  + S S+ HI +MV+   +
Subjt:  GDGLRRGEPVGQEVSEYGG-------------------KKDEERERSEPLKK--------KVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRA--D

Query:  EGHWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW-----------EIV
           W  TGFYG P  + R                                  GG+VR   Q+++FRD ++ CGLSD+GF G+ FTW           E++
Subjt:  EGHWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW-----------EIV

Query:  EK-----RWND--PGHV----------HC---------------------------LN---QGSLMDAW-KVR------------VKGCADVLSGWGRTR
        ++      W D  P H           HC                           LN      L + W K R            ++ C D L  W R R
Subjt:  EK-----RWND--PGHV----------HC---------------------------LN---QGSLMDAW-KVR------------VKGCADVLSGWGRTR

Query:  KGNYASKISLARQQL-QVAMSNQG-------DVQETLDG--------STKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPG
        +GN+   +     QL  + +SN G        +Q+ LDG         T   P   +L   L ++ P VS   N++L KP+S +E+  +L QM    +PG
Subjt:  KGNYASKISLARQQL-QVAMSNQG-------DVQETLDG--------STKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPG

Query:  EDGLPALFYQKFWGIVGKET---------------------------------------------------------------------QSAFVPARNVC
         DG PA FYQ+ W  VG +                                                                      QSAF+P R + 
Subjt:  EDGLPALFYQKFWGIVGKET---------------------------------------------------------------------QSAFVPARNVC

Query:  DNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLC
        DNVI+ FE LH ++ + +G EG+ ALKLDMSKAYDR+EW FL  ++  +G +   + L+M CV TVSY+ LVNG       PSRGIRQGDP+SPY F+LC
Subjt:  DNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLC

Query:  AEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFH
        AE LS ++   E  G + GV + RG   +SHLFFADD  +F KA   E   +   L  Y   +GQ +N+ K+ + FS N     +  I    G++    +
Subjt:  AEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFH

Query:  DRYLGLPAVFLGRKALSLKYVKDQFGLMFRS
        + YLGLP+V    K  + + + D+  +   S
Subjt:  DRYLGLPAVFLGRKALSLKYVKDQFGLMFRS

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase4.6e-9730.32Show/hide
Query:  KKWELTERETHAIEAVDNSRLF--AAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVM
        + + LTE E   +E   + RL      E + CL  K+LS + +N +  RN+M  VW++    ++   G+N+F+ +F S LEK RV +  PWNF+K+L+V+
Subjt:  KKWELTERETHAIEAVDNSRLF--AAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVM

Query:  VSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEV------------
         S    D V ++     SFW Q H++PL  M   + +++GE  G +EEI+  G     G  +R R  +++ KPL+R + +   +G ++            
Subjt:  VSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEV------------

Query:  ----WCPIHV-----------------------WC-----------------------SEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGG
             C  HV                       W                         E+E+        EG G  G + Q   ++  AR+  D     
Subjt:  ----WCPIHV-----------------------WC-----------------------SEDERSGGGGFEIEGRGGDGGRDQLEGEVGVAREGGDKGGGG

Query:  SDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTE-------VVDKGKGLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVSEGGWEE
         D     G   T  E V   + +  VGD+ Q+  E        V   K ++ +G    +   + +G  L  K     V + V E +   G +  +  W+ 
Subjt:  SDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTE-------VVDKGKGLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINVSEGGWEE

Query:  VEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSN
                  G   +  +  GQ+     G   E R++S+    +V+S  +SGG  L WK   +V + S S  H D +V   +G   WRFTGFYG+P    
Subjt:  VEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGH--WRFTGFYGDPELSN

Query:  R----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWEIVEKRWNDPGHVHCLNQGSLMDAWKVRVKG
        R                                  GGS+R   Q+Q FR+ I DC L  +   G   TW    +   +      L++  +  +W+ R   
Subjt:  R----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTWEIVEKRWNDPGHVHCLNQGSLMDAWKVRVKG

Query:  CADVLSGWGRTRKGNYASKISLARQQLQVAMSN--QGDVQETLDGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGED
                   +K N   KI  AR +  V   +  Q  V       T  H R  +   F  +I+  +S     +L K +   E+  ++ QM P+KA G D
Subjt:  CADVLSGWGRTRKGNYASKISLARQQLQVAMSN--QGDVQETLDGSTKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGED

Query:  GLPALFYQKFWGIVG--------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGM
        G+ A+FYQK+W ++G                     E QSAFVP R + DN ++ +E +HY++ +K G  G  ALKLDMSKAYDRVEW +LE++M  +G 
Subjt:  GLPALFYQKFWGIVG--------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGM

Query:  DVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARW
            ++LIM  VRTVSYS +VNGK+     PSRGIRQGDPISPYLFLLC EGLS +L+  +++G + GV + R +P I+HL FADD  LF +A  +EA  
Subjt:  DVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARW

Query:  VSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ
        +   L  Y   +GQ VN+ KS V FS N  +  ++AI   LGV      D+YLG+P +    K    +++KD+
Subjt:  VSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ

A0A2N9J7Z2 Reverse transcriptase domain-containing protein1.7e-9126.47Show/hide
Query:  LAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMT
        LAAK  + + IN +A     K +W+   +  ++  GDNI LI+F  + +  RV+   PW++DK LI     RE   V EL F K  FW+Q+HN+P+ CM 
Subjt:  LAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMT

Query:  VDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCPIH--------VWCSEDERSGGGGFEIEGRGGDGGRDQLEGE
          +A+ LG  +G +   +   +   +G  +RIRV VDI +PL R  +I   +G E W             WC                            
Subjt:  VDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCPIH--------VWCSEDERSGGGGFEIEGRGGDGGRDQLEGE

Query:  VGVAREGGDKGGGGSDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTEVVDKGKGLWVEGGGVER------GNEVRNGTDLIIKDVSGEVERRVGEGAV
         G+   G                                            +K    W++ G  E+      G  +R   +  I+ V  +VE        
Subjt:  VGVAREGGDKGGGGSDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTEVVDKGKGLWVEGGGVER------GNEVRNGTDLIIKDVSGEVERRVGEGAV

Query:  GVGINVSEGGWEEVEEEGEGWGSGDGL------------------RRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSM
                 GW      G G G+G G                   R+ +P   E +EY G+  E            +   +     + +  G+ +   S+
Subjt:  GVGINVSEGGWEEVEEEGEGWGSGDGL------------------RRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSM

Query:  -SRFHIDVMVRADEGH-WRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW
         +  HID ++R      WR TG YG  E   R                                  G   R  RQM+ FR A+D+ GL D+ FRG PFTW
Subjt:  -SRFHIDVMVRADEGH-WRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW

Query:  ---------------------------------------------------------------------------EIVEKRWNDPGHVHCLNQGSLMDAW
                                                                                   EI+ K W+         +G+ M + 
Subjt:  ---------------------------------------------------------------------------EIVEKRWNDPGHVHCLNQGSLMDAW

Query:  KVRVKGCADVLSGWGRTRKGNYASKISLARQQLQVA-------MSNQGDVQETL-----DGSTKGHPRGEE--------------------LMEFLAHIQ
          +++ C   LS W R+  G+   KI   + ++           S   DV+E L     D  +  H   EE                    L+E ++ I 
Subjt:  KVRVKGCADVLSGWGRTRKGNYASKISLARQQLQVA-------MSNQGDVQETL-----DGSTKGHPRGEE--------------------LMEFLAHIQ

Query:  PCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-------------------------------------------------
          V+E+ N +L + F  +E+  ++KQM P+KAPG DG+P +FYQK+W ++G                                                 
Subjt:  PCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-------------------------------------------------

Query:  --------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTV
                             E+QSAFVP R + DNV++ FE LH++   K G EG  ALKLDMSKAYDRVEW +L ++M  +G   K + L+  C+ TV
Subjt:  --------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTV

Query:  SYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQE
        SYS L+NG+  G + PSRG+RQGDP+SPYLFLLCAEGL  ++   ES G + GV + RG P I+HLFFADD  LF KA  +    +   L  Y   +GQ+
Subjt:  SYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQE

Query:  VNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ
        VN  K+ + FS  V    + AI  +L V ++  +++YLGLP++    +A S   +K++
Subjt:  VNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQ

A0A6P9EQ08 uncharacterized protein LOC1090209829.3e-9027.32Show/hide
Query:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK
        EEL  + K ++LTE+E+  +    ++   A  +   CL   +++ K IN +AFRN M  VWR     +    G+N FLI+F    ++ RV+K  PW+FD+
Subjt:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK

Query:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQN-GWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP---
         L+ + +   N  V ++ FT+E FW+QVHN+PL  MT ++ K +G+ VG + ++  D +  GW G  +RIRV + I K L R + + T DG++ W     
Subjt:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQN-GWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCP---

Query:  --IHVWCSEDERSGGGGFEIEGRGGDGGRDQL----------EGEVGVAREGGDKG---GGGSDEVIEPGKGGTENEPVRLGSREG-----GVGDAVQEG
          +  +C +       G   +       ++Q           EG++   R G D      G        GK G E    +  S+EG        D +QE 
Subjt:  --IHVWCSEDERSGGGGFEIEGRGGDGGRDQL----------EGEVGVAREGGDKG---GGGSDEVIEPGKGGTENEPVRLGSREG-----GVGDAVQEG

Query:  TEVV--------DKGKGLWVEGGGVERGN----EVRNGTDL-IIKDVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGE-----------------GWGS
           +        D  + L      + + N    ++   T L  +K+          +     G +  +   +  E   +                  W  
Subjt:  TEVV--------DKGKGLWVEGGGVERGN----EVRNGTDL-IIKDVSGEVERRVGEGAVGVGINVSEGGWEEVEEEGE-----------------GWGS

Query:  GDGLRRGEPVGQEVSEYGG-------------------KKDEERERSEPLKK--------KVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRA--D
           L + EP+G+E  E                      +   +R + E +K+         ++  G SGG    WK  ++  + S S+ HI +MV+   +
Subjt:  GDGLRRGEPVGQEVSEYGG-------------------KKDEERERSEPLKK--------KVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRA--D

Query:  EGHWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW-----------EIV
           W  TGFYG P  + R                                  GG+VR   Q+++FRD ++ CGLSD+GF G+ FTW           E++
Subjt:  EGHWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW-----------EIV

Query:  EK-----RWND--PGHV----------HC---------------------------LN---QGSLMDAW-KVR------------VKGCADVLSGWGRTR
        ++      W D  P H           HC                           LN      L + W K R            ++ C D L  W R R
Subjt:  EK-----RWND--PGHV----------HC---------------------------LN---QGSLMDAW-KVR------------VKGCADVLSGWGRTR

Query:  KGNYASKISLARQQL-QVAMSNQG-------DVQETLDG--------STKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPG
        +GN+   +     QL  + +SN G        +Q+ LDG         T   P   +L   L ++ P VS   N++L KP+S +E+  +L QM    +PG
Subjt:  KGNYASKISLARQQL-QVAMSNQG-------DVQETLDG--------STKGHPRGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPG

Query:  EDGLPALFYQKFWGIVGKET---------------------------------------------------------------------QSAFVPARNVC
         DG PA FYQ+ W  VG +                                                                      QSAF+P R + 
Subjt:  EDGLPALFYQKFWGIVGKET---------------------------------------------------------------------QSAFVPARNVC

Query:  DNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLC
        DNVI+ FE LH ++ + +G EG+ ALKLDMSKAYDR+EW FL  ++  +G +   + L+M CV TVSY+ LVNG       PSRGIRQGDP+SPY F+LC
Subjt:  DNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLC

Query:  AEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFH
        AE LS ++   E  G + GV + RG   +SHLFFADD  +F KA   E   +   L  Y   +GQ +N+ K+ + FS N     +  I    G++    +
Subjt:  AEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFH

Query:  DRYLGLPAVFLGRKALSLKYVKDQFGLMFRS
        + YLGLP+V    K  + + + D+  +   S
Subjt:  DRYLGLPAVFLGRKALSLKYVKDQFGLMFRS

A0A7N2R0C3 Reverse transcriptase domain-containing protein3.8e-9125.83Show/hide
Query:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK
        EEL +  ++ ++TE E  +I   D     A E    C+  KV+S K +  +A R  ++ +W+ +++ ++   G+ +FL++F  + +K RV+   PW+++K
Subjt:  EELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDK

Query:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKI---------------
         L++      ++   ++      FW+Q++N+PL   T +  K +GE +G   E++ +      G  +R+RV +D+ + L R  KI               
Subjt:  SLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKI---------------

Query:  ------------------------KTKDGKEVWCPIHVWC-SEDERSGGGGFEIEGRGGDG--------------GRDQLEGEVGVARE-----------
                                K K G+E       W   E  R GG  F    +   G              GRD++  + GVARE           
Subjt:  ------------------------KTKDGKEVWCPIHVWC-SEDERSGGGGFEIEGRGGDG--------------GRDQLEGEVGVARE-----------

Query:  -----GGDKGGGGSDEVIEPGKGGT--------ENEPVRLGSREGGVGDAVQEGTEVVDKGK----GLWVEGGGVERGNEVRNGTDLII---------KD
              GD  GGG    ++ G+ G           E V +G     VG+    G E   KG+       +       GNE    TD ++         KD
Subjt:  -----GGDKGGGGSDEVIEPGKGGT--------ENEPVRLGSREGGVGDAVQEGTEVVDKGK----GLWVEGGGVERGNEVRNGTDLII---------KD

Query:  VSGEVERRVGEGAVGVGINVSEGGWEEVEEEG-------------------------------------------------EGWGSG-------DGLRRG
            ++    EG V   +  S G W+ +   G                                                  G GS        D ++ G
Subjt:  VSGEVERRVGEGAVGVGINVSEGGWEEVEEEG-------------------------------------------------EGWGSG-------DGLRRG

Query:  EPVGQEVSEYGGKKDEERERSEPLKKK--------VESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEG--HWRFTGFYGDPELSNR--------
        +PV   ++E        + R + L++K        V S G+SGG  + W+ G+DV L+S S  HIDV+V    G   WR TGFYG P+   R        
Subjt:  EPVGQEVSEYGGKKDEERERSEPLKKK--------VESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEG--HWRFTGFYGDPELSNR--------

Query:  --------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW----------------EIVEKRWND------------PGH
                                  G   R+ RQM+ FR+ + +CGL D+GF G  FTW                 +  + W +               
Subjt:  --------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRGHPFTW----------------EIVEKRWND------------PGH

Query:  VHCLNQGSLM---------------DAWKVRVKGCADV--------------------------LSGWGRTRKGNY-----------------------A
         HCL   S+                + W  R +GC +V                          L  W R   GN                        A
Subjt:  VHCLNQGSLM---------------DAWKVRVKGCADV--------------------------LSGWGRTRKGNY-----------------------A

Query:  SKISLARQQLQVAMSNQ---------------GD------------------VQETLDGSTKGHPRGEELMEF--------------------LAHIQPC
         ++   ++++   M  +               GD                  ++  LD   +     EE+ E                     L  +   
Subjt:  SKISLARQQLQVAMSNQ---------------GD------------------VQETLDGSTKGHPRGEELMEF--------------------LAHIQPC

Query:  VSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG---------------------------------------------------
        V+E  N  L + F EEE+  +L QM PTK+PG DG+  +F+QK+W +VG                                                   
Subjt:  VSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG---------------------------------------------------

Query:  ------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSY
                           E QSAFVP R + DNV++ FE +H I QR++G EG  A+KLDMSKAYDRVEW +LE +M  +G   + ++L+M CV TVS+
Subjt:  ------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSY

Query:  SFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVN
        S L+NG+  G + P+RG+RQGDPISPYLFLLCAEGLS ML   E    VSGV+I R +P ISHL FADDC +F KA   E   V+K L  Y   +GQ++N
Subjt:  SFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVN

Query:  LAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQFG
          K+ + FS N  V+ K+ +    G +++  H+RYLGLP +    K  +   +KDQ G
Subjt:  LAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQFG

M5VU98 Reverse transcriptase domain-containing protein8.4e-9127.2Show/hide
Query:  MEELIMRL-KKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNF
        ME ++     ++ LTE E  A+  V+  +    +     L  KVLS + IN +AF+  M  +WR      I     ++F+  F++   +A +++ GPW F
Subjt:  MEELIMRL-KKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNF

Query:  DKSLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVW----
        +  L+V+    +      +   ++ FW+QV  +PL  MT +M K++G+ +G     +   +    G  +RIRVV+DI KPL+R + I+ ++GK  W    
Subjt:  DKSLIVMVSPRENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVW----

Query:  ------------CPIHV---------------------WCSED------ERSGGGGFEIEGRGGDGGR------DQLEGEVGVAREGGDKGGGGSD----
                    C  H+                     W  ED       R  G  F I    G   +      +++E  V VA + G  GG G D    
Subjt:  ------------CPIHV---------------------WCSED------ERSGGGGFEIEGRGGDGGR------DQLEGEVGVAREGGDKGGGGSD----

Query:  --EVIEPGKGGTENEPVRLGSREG-------------GVGDAVQEGTE-VVDKGKGLWVEGGGVERGNEVRNGTDL--IIKDVSGEVERRVGEGAVGVGI
          EV++P           L   E               +G   +   E + D  + L +            N TDL  II DV         E   G+G+
Subjt:  --EVIEPGKGGTENEPVRLGSREG-------------GVGDAVQEGTE-VVDKGKGLWVEGGGVERGNEVRNGTDL--IIKDVSGEVERRVGEGAVGVGI

Query:  NVSEG-GWEEVEEEGEGWGSGDGLRRG-----------------EPVGQEVSEYGGKKDEERER-----SEPLKKKVESV----GKSGGFCLFWKGGLDV
            G G +    EG G  S    R+G                  P+ ++V +    +    ER     S   K +  S     G SGG  L WK  +DV
Subjt:  NVSEG-GWEEVEEEGEGWGSGDGLRRG-----------------EPVGQEVSEYGGKKDEERER-----SEPLKKKVESV----GKSGGFCLFWKGGLDV

Query:  VLRSMSRFHIDVMVRADEG--HWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRG
         + + S   IDV + ++ G   WR T FYG P + +R                                  GG +RN RQMQ FR+ +D  G  D+GF G
Subjt:  VLRSMSRFHIDVMVRADEG--HWRFTGFYGDPELSNR----------------------------------GGSVRNQRQMQAFRDAIDDCGLSDMGFRG

Query:  HPFTWE---------------IVEKRWND--PG-----------------------------------------HVHCLNQGSLMDAWK-----------
        + FTW+               +    W +  PG                                         HV C  + ++   W+           
Subjt:  HPFTWE---------------IVEKRWND--PG-----------------------------------------HVHCLNQGSLMDAWK-----------

Query:  -VRVKGCADVLSGWGRTRKGNYASKISLARQQL----QVAMSNQGD-----VQETLD-------------------------------------------
          ++K    VL  W ++  G+   +  + R +L    Q   S + +     VQ++LD                                           
Subjt:  -VRVKGCADVLSGWGRTRKGNYASKISLARQQL----QVAMSNQGD-----VQETLD-------------------------------------------

Query:  ----------GSTKG-------------HPRGEELM-EFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-----
                   S +G                G  +M E L+ ++P V+     +L   FS +E+  ++ QM P+KAPG DGLP LFYQK+W IVG     
Subjt:  ----------GSTKG-------------HPRGEELM-EFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVG-----

Query:  ----------------------------------------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEG
                                                                         E+QSAFVP R + DN I+ FE  H+++QR+RG +G
Subjt:  ----------------------------------------------------------------KETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEG

Query:  WAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKI
          ALKLDMSKAYDRVEW FLEK+MLA+G  +  V ++M CV TVSYSFLVNG+    L P+RG+RQGDP+SPYLFLLCAEG + +LS  E +G++ G+ I
Subjt:  WAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKI

Query:  ARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVK
         RG+P++SHLFFADD F+F KA  +    +      Y   +GQ++N  KS V FS N+ +  +  +AS LGV  V  H  YLGLP +    K +  +Y+K
Subjt:  ARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVK

Query:  DQ
        ++
Subjt:  DQ

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.1e-1424.79Show/hide
Query:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQG
        Q  F+P      N+      +HYI + K   +    + LD  KA+D+++  F+ K++   G+    + +I         +  VNG+++  +    G RQG
Subjt:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQG

Query:  DPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIA
         P+SPYLF +  E L+R    +  + ++ G++I +    IS L  ADD  ++    ++  R +   ++++  + G ++N  KS + F      QA++ I 
Subjt:  DPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIA

Query:  SSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQFGLMFRS
         +    +V  + +YLG+            K VKD +   F+S
Subjt:  SSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQFGLMFRS

P14381 Transposon TX1 uncharacterized 149 kDa protein7.0e-1023.34Show/hide
Query:  EELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVGKE------------------------------------
        EEL + L    P VSE     L  P + +EL  +L+ M   K+PG DGL   F+Q FW  +G +                                    
Subjt:  EELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVGKE------------------------------------

Query:  ---------------------------------TQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKV
                                          QS  VP R + DNV L  + LH+ R   R G   A L LD  KA+DRV+  +L   + A     + 
Subjt:  ---------------------------------TQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKV

Query:  VALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKS
        V  +     +      +N      L   RG+RQG P+S  L+ L  E    +L       +++G+ +      +    +ADD  L  +      R   + 
Subjt:  VALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKS

Query:  LSAYSILTGQEVNLAKS
           Y+  +   +N +KS
Subjt:  LSAYSILTGQEVNLAKS

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.9e-0728.43Show/hide
Query:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQG
        Q  F+P     DN  +    L +  +  R         LD+SKA+D +    +   + A G     V  +         S   +G       P+RG++QG
Subjt:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQG

Query:  DPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQ
        DP+SP LF L    + R+L  L SE    G K+  G+   +   FADD  LF + +      + K+L   SI+ G ++N  K   CF+  +  Q KQ
Subjt:  DPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQ

P92555 Uncharacterized mitochondrial protein AtMg012507.7e-1743.48Show/hide
Query:  FLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSI
        F++NG   G +TPSRG+RQGDP+SPYLF+LC E LS +    + +G++ G++++  SP I+HL FADD         S ARW+  +   + I
Subjt:  FLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSI

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.9e-0730Show/hide
Query:  HYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSW
        HYI+ R+  G+ +  + LD+ KA+D V    + + M A G+D  +   IM  +     + +V G+    +    G++QGDP+SP LF +    L  +++ 
Subjt:  HYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSW

Query:  LESEGKVSGVKIARGSPSISHLFFADDCFL
        L  E   + +  A     I+ L FADD  L
Subjt:  LESEGKVSGVKIARGSPSISHLFFADDCFL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.5e-1044.26Show/hide
Query:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALG
        Q++F+P R   DN++   E +H +R RK+G +GW  LKLD+ KAYDR+ W +LE  +++ G
Subjt:  QSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSKAYDRVEWFFLEKLMLALG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.5e-1843.48Show/hide
Query:  FLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSI
        F++NG   G +TPSRG+RQGDP+SPYLF+LC E LS +    + +G++ G++++  SP I+HL FADD         S ARW+  +   + I
Subjt:  FLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFFKAKQSEARWVSKSLSAYSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGTTGATCATGCGGTTGAAGAAGTGGGAGTTGACTGAGAGGGAGACACACGCGATTGAGGCTGTTGATAATAGTAGATTGTTTGCGGCTGAGGAGATTGACCT
ATGTCTTGCTGCAAAAGTTTTATCCAGCAAATACATTAATGCTGATGCCTTCAGAAACATGATGAAATCAGTGTGGAGGGTTCATCAGAATACGCGGATTGAGCCTGCAG
GTGACAACATTTTTCTCATAAAATTCAGATCAAAATTGGAGAAAGCGAGGGTTGTGAAATCAGGTCCTTGGAATTTTGATAAGTCATTGATTGTTATGGTTTCTCCTAGA
GAGAATGATCAGGTAGGGGAGTTAGATTTCACAAAGGAATCATTTTGGATTCAGGTCCATAATGTCCCACTAAATTGTATGACGGTTGATATGGCAAAGCTGTTGGGAGA
AGTAGTGGGTGTGATAGAGGAGATTGAGTGGGACGGGCAGAATGGTTGGACGGGGCCAATAGTGAGGATCCGTGTGGTTGTTGACATTCTGAAACCATTGCAGCGGGTTG
TTAAAATAAAAACTAAAGATGGGAAGGAAGTGTGGTGCCCTATCCACGTATGGTGTTCGGAGGACGAGAGGAGTGGGGGAGGGGGATTTGAGATCGAAGGAAGGGGTGGA
GATGGTGGGCGTGATCAGTTAGAAGGGGAGGTGGGTGTGGCGAGGGAAGGTGGAGATAAGGGGGGAGGGGGATCAGATGAGGTGATAGAACCAGGGAAAGGGGGGACGGA
GAATGAACCAGTAAGGTTGGGGAGTAGAGAGGGGGGAGTAGGGGATGCTGTGCAGGAGGGGACTGAGGTGGTGGATAAAGGGAAGGGGCTGTGGGTTGAAGGGGGAGGGG
TGGAGAGGGGAAATGAGGTGAGGAACGGAACCGACCTTATTATTAAAGATGTGAGTGGGGAGGTGGAGAGGAGGGTGGGAGAAGGGGCAGTGGGTGTTGGTATTAATGTT
TCCGAAGGGGGGTGGGAAGAAGTGGAAGAGGAGGGCGAGGGATGGGGCAGTGGCGATGGATTGCGGAGGGGAGAGCCAGTTGGGCAAGAGGTCTCTGAATATGGAGGGAA
AAAAGATGAAGAAAGGGAACGAAGTGAGCCGCTTAAGAAGAAAGTTGAGAGCGTTGGCAAAAGTGGGGGGTTCTGTTTGTTCTGGAAGGGGGGCCTGGATGTTGTGTTGC
GTTCGATGAGTAGGTTTCATATTGATGTGATGGTTAGGGCAGATGAGGGGCATTGGAGGTTTACGGGGTTTTATGGGGATCCGGAGCTGTCTAATAGGGGAGGCAGTGTT
CGAAATCAGCGCCAAATGCAGGCGTTTCGTGATGCTATAGATGATTGTGGCCTATCTGATATGGGATTTAGAGGGCATCCTTTTACATGGGAGATTGTTGAGAAGAGATG
GAATGACCCAGGCCATGTCCATTGTTTGAATCAGGGTAGCCTTATGGATGCTTGGAAGGTCAGGGTGAAAGGGTGTGCAGATGTGTTGTCGGGGTGGGGCAGAACGAGGA
AGGGTAATTATGCTAGTAAGATTTCCTTGGCCCGACAACAGTTGCAAGTGGCCATGTCCAATCAGGGGGATGTTCAAGAAACTCTAGATGGTTCCACCAAAGGGCATCCC
AGAGGCGAAGAGTTAATGGAATTCCTTGCCCACATTCAGCCGTGTGTGAGTGAGAGTGACAACGCACTGTTAAGGAAGCCTTTTTCAGAGGAGGAATTGTTGTGTTCTCT
AAAGCAGATGGGTCCAACAAAGGCCCCGGGTGAGGATGGGCTCCCAGCTCTCTTCTATCAGAAGTTTTGGGGCATCGTGGGGAAGGAGACCCAAAGTGCGTTTGTCCCAG
CAAGGAATGTATGTGATAATGTCATTCTAGGTTTTGAGTGTCTCCACTATATACGGCAGAGGAAGCGAGGGGGTGAAGGGTGGGCGGCTCTGAAGTTAGACATGAGCAAG
GCCTACGATAGGGTGGAGTGGTTTTTTCTAGAAAAGTTGATGCTCGCGCTGGGGATGGATGTGAAGGTGGTGGCTCTTATAATGAGGTGCGTGCGAACGGTTTCATATTC
GTTCTTGGTGAATGGTAAGAGAGTAGGTTGCTTAACGCCATCTAGAGGGATCCGTCAGGGGGATCCTATCTCACCCTATTTATTCTTATTATGTGCAGAAGGGTTGTCTA
GAATGTTATCCTGGTTGGAGAGTGAGGGTAAAGTGAGTGGTGTTAAAATTGCGAGGGGAAGCCCCTCGATATCCCATTTGTTTTTTGCAGACGACTGTTTTCTGTTTTTT
AAGGCCAAACAGAGTGAGGCCAGGTGGGTGTCTAAAAGTTTGAGTGCATATTCCATCCTCACAGGGCAAGAGGTAAACCTTGCGAAGTCAGGTGTGTGTTTTAGCCCGAA
CGTGGGGGTGCAGGCTAAACAGGCCATTGCCTCGTCTTTGGGAGTTCAGTTGGTGGGGTTCCATGATCGTTACTTGGGCCTGCCAGCTGTCTTTCTGGGGAGGAAAGCCC
TTTCTCTGAAGTATGTTAAGGATCAATTTGGTCTAATGTTCAGAAGTGGAAGCACACATGGTTTTCGGTTGGAGGGAAGGAGGTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGTTGATCATGCGGTTGAAGAAGTGGGAGTTGACTGAGAGGGAGACACACGCGATTGAGGCTGTTGATAATAGTAGATTGTTTGCGGCTGAGGAGATTGACCT
ATGTCTTGCTGCAAAAGTTTTATCCAGCAAATACATTAATGCTGATGCCTTCAGAAACATGATGAAATCAGTGTGGAGGGTTCATCAGAATACGCGGATTGAGCCTGCAG
GTGACAACATTTTTCTCATAAAATTCAGATCAAAATTGGAGAAAGCGAGGGTTGTGAAATCAGGTCCTTGGAATTTTGATAAGTCATTGATTGTTATGGTTTCTCCTAGA
GAGAATGATCAGGTAGGGGAGTTAGATTTCACAAAGGAATCATTTTGGATTCAGGTCCATAATGTCCCACTAAATTGTATGACGGTTGATATGGCAAAGCTGTTGGGAGA
AGTAGTGGGTGTGATAGAGGAGATTGAGTGGGACGGGCAGAATGGTTGGACGGGGCCAATAGTGAGGATCCGTGTGGTTGTTGACATTCTGAAACCATTGCAGCGGGTTG
TTAAAATAAAAACTAAAGATGGGAAGGAAGTGTGGTGCCCTATCCACGTATGGTGTTCGGAGGACGAGAGGAGTGGGGGAGGGGGATTTGAGATCGAAGGAAGGGGTGGA
GATGGTGGGCGTGATCAGTTAGAAGGGGAGGTGGGTGTGGCGAGGGAAGGTGGAGATAAGGGGGGAGGGGGATCAGATGAGGTGATAGAACCAGGGAAAGGGGGGACGGA
GAATGAACCAGTAAGGTTGGGGAGTAGAGAGGGGGGAGTAGGGGATGCTGTGCAGGAGGGGACTGAGGTGGTGGATAAAGGGAAGGGGCTGTGGGTTGAAGGGGGAGGGG
TGGAGAGGGGAAATGAGGTGAGGAACGGAACCGACCTTATTATTAAAGATGTGAGTGGGGAGGTGGAGAGGAGGGTGGGAGAAGGGGCAGTGGGTGTTGGTATTAATGTT
TCCGAAGGGGGGTGGGAAGAAGTGGAAGAGGAGGGCGAGGGATGGGGCAGTGGCGATGGATTGCGGAGGGGAGAGCCAGTTGGGCAAGAGGTCTCTGAATATGGAGGGAA
AAAAGATGAAGAAAGGGAACGAAGTGAGCCGCTTAAGAAGAAAGTTGAGAGCGTTGGCAAAAGTGGGGGGTTCTGTTTGTTCTGGAAGGGGGGCCTGGATGTTGTGTTGC
GTTCGATGAGTAGGTTTCATATTGATGTGATGGTTAGGGCAGATGAGGGGCATTGGAGGTTTACGGGGTTTTATGGGGATCCGGAGCTGTCTAATAGGGGAGGCAGTGTT
CGAAATCAGCGCCAAATGCAGGCGTTTCGTGATGCTATAGATGATTGTGGCCTATCTGATATGGGATTTAGAGGGCATCCTTTTACATGGGAGATTGTTGAGAAGAGATG
GAATGACCCAGGCCATGTCCATTGTTTGAATCAGGGTAGCCTTATGGATGCTTGGAAGGTCAGGGTGAAAGGGTGTGCAGATGTGTTGTCGGGGTGGGGCAGAACGAGGA
AGGGTAATTATGCTAGTAAGATTTCCTTGGCCCGACAACAGTTGCAAGTGGCCATGTCCAATCAGGGGGATGTTCAAGAAACTCTAGATGGTTCCACCAAAGGGCATCCC
AGAGGCGAAGAGTTAATGGAATTCCTTGCCCACATTCAGCCGTGTGTGAGTGAGAGTGACAACGCACTGTTAAGGAAGCCTTTTTCAGAGGAGGAATTGTTGTGTTCTCT
AAAGCAGATGGGTCCAACAAAGGCCCCGGGTGAGGATGGGCTCCCAGCTCTCTTCTATCAGAAGTTTTGGGGCATCGTGGGGAAGGAGACCCAAAGTGCGTTTGTCCCAG
CAAGGAATGTATGTGATAATGTCATTCTAGGTTTTGAGTGTCTCCACTATATACGGCAGAGGAAGCGAGGGGGTGAAGGGTGGGCGGCTCTGAAGTTAGACATGAGCAAG
GCCTACGATAGGGTGGAGTGGTTTTTTCTAGAAAAGTTGATGCTCGCGCTGGGGATGGATGTGAAGGTGGTGGCTCTTATAATGAGGTGCGTGCGAACGGTTTCATATTC
GTTCTTGGTGAATGGTAAGAGAGTAGGTTGCTTAACGCCATCTAGAGGGATCCGTCAGGGGGATCCTATCTCACCCTATTTATTCTTATTATGTGCAGAAGGGTTGTCTA
GAATGTTATCCTGGTTGGAGAGTGAGGGTAAAGTGAGTGGTGTTAAAATTGCGAGGGGAAGCCCCTCGATATCCCATTTGTTTTTTGCAGACGACTGTTTTCTGTTTTTT
AAGGCCAAACAGAGTGAGGCCAGGTGGGTGTCTAAAAGTTTGAGTGCATATTCCATCCTCACAGGGCAAGAGGTAAACCTTGCGAAGTCAGGTGTGTGTTTTAGCCCGAA
CGTGGGGGTGCAGGCTAAACAGGCCATTGCCTCGTCTTTGGGAGTTCAGTTGGTGGGGTTCCATGATCGTTACTTGGGCCTGCCAGCTGTCTTTCTGGGGAGGAAAGCCC
TTTCTCTGAAGTATGTTAAGGATCAATTTGGTCTAATGTTCAGAAGTGGAAGCACACATGGTTTTCGGTTGGAGGGAAGGAGGTACTAA
Protein sequenceShow/hide protein sequence
MEELIMRLKKWELTERETHAIEAVDNSRLFAAEEIDLCLAAKVLSSKYINADAFRNMMKSVWRVHQNTRIEPAGDNIFLIKFRSKLEKARVVKSGPWNFDKSLIVMVSPR
ENDQVGELDFTKESFWIQVHNVPLNCMTVDMAKLLGEVVGVIEEIEWDGQNGWTGPIVRIRVVVDILKPLQRVVKIKTKDGKEVWCPIHVWCSEDERSGGGGFEIEGRGG
DGGRDQLEGEVGVAREGGDKGGGGSDEVIEPGKGGTENEPVRLGSREGGVGDAVQEGTEVVDKGKGLWVEGGGVERGNEVRNGTDLIIKDVSGEVERRVGEGAVGVGINV
SEGGWEEVEEEGEGWGSGDGLRRGEPVGQEVSEYGGKKDEERERSEPLKKKVESVGKSGGFCLFWKGGLDVVLRSMSRFHIDVMVRADEGHWRFTGFYGDPELSNRGGSV
RNQRQMQAFRDAIDDCGLSDMGFRGHPFTWEIVEKRWNDPGHVHCLNQGSLMDAWKVRVKGCADVLSGWGRTRKGNYASKISLARQQLQVAMSNQGDVQETLDGSTKGHP
RGEELMEFLAHIQPCVSESDNALLRKPFSEEELLCSLKQMGPTKAPGEDGLPALFYQKFWGIVGKETQSAFVPARNVCDNVILGFECLHYIRQRKRGGEGWAALKLDMSK
AYDRVEWFFLEKLMLALGMDVKVVALIMRCVRTVSYSFLVNGKRVGCLTPSRGIRQGDPISPYLFLLCAEGLSRMLSWLESEGKVSGVKIARGSPSISHLFFADDCFLFF
KAKQSEARWVSKSLSAYSILTGQEVNLAKSGVCFSPNVGVQAKQAIASSLGVQLVGFHDRYLGLPAVFLGRKALSLKYVKDQFGLMFRSGSTHGFRLEGRRY