; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022627 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022627
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:34617228..34620416
RNA-Seq ExpressionLag0022627
SyntenyLag0022627
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAH93889.1 Os07g0417700 [Oryza sativa Japonica Group]2.9e-4330.04Show/hide
Query:  KLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKIC
        KLD+SKAYNRV+W FL + M K+GF   W++ +M C+ SV FSV  NG   D F P+RGL+QGDP+SP+L L  A+GLS LL  + S      + I +  
Subjt:  KLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKIC

Query:  HSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEI-LKIPLGSKNAMDEIIWAKHP----KGSFSVKTAYHLAVNQEEHRKASCSDNSKIQ
          ISHL F DD+L+F +++      IK  L TY  A+ Q        I  GS +        KH       SF  K   +L     E R       +  +
Subjt:  HSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEI-LKIPLGSKNAMDEIIWAKHP----KGSFSVKTAYHLAVNQEEHRKASCSDNSKIQ

Query:  AIWKNFWKIKAIPRAKICEQIGIKHAFD-MGVPAWDALLIWEGLVERLEE------------EEISIAILILWNIWTTRNKVINNGYKADQNQISK-IIE
         IWK          +   +++ IK     + V       + E + E L              EE  ++++I+W  W  RN+ + +G      ++SK  +E
Subjt:  AIWKNFWKIKAIPRAKICEQIGIKHAFD-MGVPAWDALLIWEGLVERLEE------------EEISIAILILWNIWTTRNKVINNGYKADQNQISK-IIE

Query:  ANISEHFKVRD-----------------TNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEA
        + +   F +R                   +   S+ +   S  LWS P   ++K+N D S+ + +E GG+G  +R+S G ++FA           LE E 
Subjt:  ANISEHFKVRD-----------------TNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEA

Query:  LAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTIE-FPLIFVD
         A +EG+    Q + C   P  +ESD    +++IQ +E + SE+  ++ EI   A     I      RS N  +H LA   R     + F L+F D
Subjt:  LAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTIE-FPLIFVD

GAU36089.1 hypothetical protein TSUD_277000 [Trifolium subterraneum]2.0e-4429.17Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLD+SKA++RV+W +L+ +M K+GF + W+  +M CV ++ + VL+N     P  P  GL+QGDP+SPYL+++C EGL+  ++  ES     G+RI + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILKIPLGSKNA-------MDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNS
          SISHL F DDS +FC+A   + + +K +L+TYE+AS Q           S+N        ++ I+      G        +L +     R      + 
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILKIPLGSKNA-------MDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNS

Query:  KIQAIWKNF--WKIKAIPRA---KICEQIGIKHAFDMGVPAWDALLIWEGLVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEH
            IWK    W  +++ RA   ++     ++H F +   ++ +++    ++  +EE + S+   +LW+IW  RN+ I    +A+    S  +  ++   
Subjt:  KIQAIWKNF--WKIKAIPRA---KICEQIGIKHAFDMGVPAWDALLIWEGLVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEH

Query:  FKVRDTNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVV
        F      L+A    +H   + W  PP  +LK N D +   +E   G+G   RDSSGS V A  M        +E EA AMK  L+  L N       ++ 
Subjt:  FKVRDTNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVV

Query:  ESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAK
        ESD   ++  ++N     +E+ +++     L  S    +  +  R  NR  H LA+
Subjt:  ESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAK

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.2e-4355.9Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        MKLDMSKAY+RVEW +LR IM K+GF   W+  +M CVESV F+VLINGVP D F PNRGL+QGDP+SPYLF++CAEGLS L+N EE       ++IN+ 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILK-IPLGSKNAMDEII
        C  ISHLF+ DD L+F +A   +C +IKGILE+YEKAS      + K + + SKN  ++++
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILK-IPLGSKNAMDEII

XP_030477911.1 uncharacterized protein LOC115694948 [Cannabis sativa]9.6e-4724.88Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA++RVEW +L  +M K+GF   W+  +M C+ +  FS  +NG       P+RGL+QGDP+SPYLFL+C+EGLS +L  EES    RG+ I + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ-----------------------------------------------DAEEILK------
          SISHL F DDSL+FC+A +     IK +L TY +AS Q                                               D +E+        
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ-----------------------------------------------DAEEILK------

Query:  -----------IPLGSK------------------------------------------NAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSK
                     +G K                                          NA D +IW     G ++VK+ +HLA   E+   +S SD ++
Subjt:  -----------IPLGSK------------------------------------------NAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSK

Query:  IQAIWKNFWKIKAIPRAKIC-----------------EQIGIKHAFDMGVPAWDAL-----------LIWEGLVERLE---------------------E
            WK FW +K  P+ +I                   +I    A  +   +W+++            +W+    R++                     +
Subjt:  IQAIWKNFWKIKAIPRAKIC-----------------EQIGIKHAFDMGVPAWDAL-----------LIWEGLVERLE---------------------E

Query:  EEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLSASRSESHHS-----------QYLWSPPPPPFLKLNSDASWSSSEEIGG
        E+  + + +LW +WT RN+V + G   D + I        ++  K +      + +  +HS           Q  WSPP     KLN DA+ +  ++  G
Subjt:  EEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLSASRSESHHS-----------QYLWSPPPPPFLKLNSDASWSSSEEIGG

Query:  LGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRS
        +G  +RD  G +V A +   + ++ + E+EA ++   L+   Q+         +E+D+  +   + +  +DLS   S++ ++  L     G+      RS
Subjt:  LGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRS

Query:  CNRATHTLAK-------DARLAGTIEFPLIFVDVS
         N+A H LAK       D    G I +P+  + V+
Subjt:  CNRATHTLAK-------DARLAGTIEFPLIFVDVS

XP_030502823.1 uncharacterized protein LOC115717993 [Cannabis sativa]1.1e-4225.77Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA +RVEW F+ E+M K+GF   WI  +M C+ +  FS L+NG P    +P+RGL+QG P+SPYLFL+C+EGLS LL  EE+S   +G ++ + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------
          SISHLFF DDSL+FC+A E+ C  +K  LE Y++AS Q                                                            
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------

Query:  -------------------------------------------------------------------------------DAEEILKIPLGSKNAMDEIIW
                                                                                       D + IL IPL      D +IW
Subjt:  -------------------------------------------------------------------------------DAEEILKIPLGSKNAMDEIIW

Query:  AKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIK---------------AIPRA--------------KICEQI--GIKHAF---DMGV
          +  G ++V+T YH A + E+   +S S  S   A WK  W +K               A+P A               IC Q    I HA        
Subjt:  AKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIK---------------AIPRA--------------KICEQI--GIKHAF---DMGV

Query:  PAW------------------DALLIWEGLVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKV-RDTNLSASRSESHHSQY
          W                  D L+    +  ++E E+I   + I+W+IWT RN+V+ +G +A   ++      N  ++F + +  +LS    ++  S  
Subjt:  PAW------------------DALLIWEGLVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKV-RDTNLSASRSESHHSQY

Query:  L--------------WSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSST
                       W PP    LKLN DA+  S+++I G+G  VRDS+G +  A +      +++ E+EA+A+   L+  LQ      +   +E+D+  
Subjt:  L--------------WSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSST

Query:  LIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDA
        +   +    S +S  + ++ +I  L      +      RS N A   LAK A
Subjt:  LIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDA

TrEMBL top hitse value%identityAlignment
A0A2N9FR17 Reverse transcriptase domain-containing protein7.9e-4726.44Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDM+KAY+RVEW+FL +IM+++GF + WI+ +  C+ +V +S+L+NG P     P+RGL+QGDP+SPYLFLLCAEGL  L+N+       +G+ + + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------
           I+HLFF DD+L+F +A    C  I+GIL+ YEKAS Q                                                            
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------

Query:  -------------------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIKAIPRAKICEQIGIK-
                           D E I +IPL +++  D++IW  +  G +SV++ Y   V++E+      S  ++     K   + + IP    CE  G K 
Subjt:  -------------------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIKAIPRAKICEQIGIK-

Query:  ----HAF---DMGVPAWDALLIW------EGLVERLE----------EEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLS
            HA          WDA  +W        +V+  +          + E    I+I W +W  RNK+         NQ+   ++  + E+   R+T  +
Subjt:  ----HAF---DMGVPAWDALLIW------EGLVERLE----------EEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLS

Query:  ASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIK
          + +   +   W PP     K+N D +        G+   VRDS G ++ +   K +       +EA A+K  +   L+    +      E DS  ++ 
Subjt:  ASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIK

Query:  LIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTIE
         + N    L+    ++ +  +LA      SF    R  N+  H LA+ A    ++E
Subjt:  LIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTIE

A0A803NML1 Uncharacterized protein9.0e-5127.95Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA++RVEW +L  IM K+GF   WI+ +M C+ +  FS  +NG       P+RGL+QGDP+SPYLFL+C+EGLS  L  +E S   +G+ + + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------
          S+SHL F DDSL+FC++ E    +IK  L+TY +AS Q                                                            
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------

Query:  ------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIKAIPRAKI---------------------
              D + IL IPL      D ++W   P G +SVKT +HLA   E+   +S S  +K    WK FW +K  P+ +I                     
Subjt:  ------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIKAIPRAKI---------------------

Query:  ------------CEQIG-----IKHAFDM------GVPAWDALLIWEG-----LVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANI
                     E IG      KHA D+       +    A  ++ G     L    ++ +    + +LW IWT RNKV++ G       I +      
Subjt:  ------------CEQIG-----IKHAFDM------GVPAWDALLIWEG-----LVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANI

Query:  SEHFKVRDTNLSASRSESHHSQ----------YLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSF
         +  K +    S + +  HHS             W PP     KLN DA+ +  ++  G+G  +RD  G+++ A +   + ++++ E+EA A+   ++  
Subjt:  SEHFKVRDTNLSASRSESHHSQ----------YLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSF

Query:  LQNSACKDRPLV-VESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAK-------DARLAGTIEFPLIFVDV
         Q+      PL  +E+D+S +   +    SDLS    ++ +I  L  S   +      R+ N+  H LAK       D    G I +P +F DV
Subjt:  LQNSACKDRPLV-VESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAK-------DARLAGTIEFPLIFVDV

A0A803PM52 Uncharacterized protein9.3e-4827.5Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA +RVEW F+ E+M K+GF   WI  +M C+ +  FS L+NG P    +P+RGL+QG P+SPYLFL+C+EGLS LL  EE+S   +G ++ + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------
          SISHLFF DDSL+FC+A E+ C  +K  LE Y++AS Q                                                            
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ------------------------------------------------------------

Query:  --------------------------------------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNF
                                              D + IL IPL      D +IW  +  G ++V+T YH A + E+   +S S  S   A WK  
Subjt:  --------------------------------------DAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNF

Query:  WKIK---------------AIPRA--------------KICEQI--GIKHAF---DMGVPAW------------------DALLIWEGLVERLEEEEISI
        W +K               A+P A               IC Q    I HA          W                  D L+    +  ++E E+I  
Subjt:  WKIK---------------AIPRA--------------KICEQI--GIKHAF---DMGVPAW------------------DALLIWEGLVERLEEEEISI

Query:  AILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKV-RDTNLSASRSESHHSQYL--------------WSPPPPPFLKLNSDASWSSSEEIGGL
         + I+W+IWT RN+V+ +G +A   ++      N  ++F + +  +LS    ++  S                 W PP    LKLN DA+  S+++I G+
Subjt:  AILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKV-RDTNLSASRSESHHSQYL--------------WSPPPPPFLKLNSDASWSSSEEIGGL

Query:  GWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSC
        G  VRDS+G +  A +      +++ E+EA+A+   L+  LQ      +   +E+D+  +   +    S +S  + ++ +I  L      +      RS 
Subjt:  GWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSC

Query:  NRATHTLAKDA
        N A   LAK A
Subjt:  NRATHTLAKDA

A0A803QB90 Uncharacterized protein4.3e-4524.11Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA++RVEW +L  +M K+GF   W+  +M C+ +  FS  +NG       P+RGL+QGDP+SPYLFL+C+EGLS +L  EES    RG+ I + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ-----------------------------------------------DAEEILK------
          SISHL F DDSL+FC+A +     IK +L TY +AS Q                                               D +E+        
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQ-----------------------------------------------DAEEILK------

Query:  -----------IPLGSKNAM------------------------------------------------------DEIIWAKHPKGSFSVKTAYHLAVNQE
                     +G K  +                                                      D +IW     G ++VK+ +HLA   E
Subjt:  -----------IPLGSKNAM------------------------------------------------------DEIIWAKHPKGSFSVKTAYHLAVNQE

Query:  EHRKASCSDNSKIQAIWKNFWKIKAIPRAKIC-----------------EQIGIKHAFDMGVPAWDAL-----------LIWEGLVERLE----------
        +   +S SD ++    WK FW +K  P+ +I                   +I    A  +   +W+++            +W+    R++          
Subjt:  EHRKASCSDNSKIQAIWKNFWKIKAIPRAKIC-----------------EQIGIKHAFDMGVPAWDAL-----------LIWEGLVERLE----------

Query:  -----------EEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLSASRSESHHS-----------QYLWSPPPPPFLKLNS
                   +E+  + + +LW +WT RN+V + G   D + I        ++  K +      + +  +HS           Q  WSPP     KLN 
Subjt:  -----------EEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLSASRSESHHS-----------QYLWSPPPPPFLKLNS

Query:  DASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARS
        DA+ +  ++  G+G  +RD  G +V A +   + ++ + E+EA ++   L+   Q+         +E+D+  +   + +  +DLS   S++ ++  L   
Subjt:  DASWSSSEEIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARS

Query:  AIGISFVWCSRSCNRATHTLAK-------DARLAGTIEFPLIFVDVS
          G+      RS N+A H LAK       D    G I +P+  + V+
Subjt:  AIGISFVWCSRSCNRATHTLAK-------DARLAGTIEFPLIFVDVS

A0A803QHW9 Uncharacterized protein2.3e-4629.13Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        +KLDMSKA++RVEW FL  +M K+GF   WIA +  C+ S   S  +NG       P+RGL+QGDP+SPY FL+ +EGLS LL  EE+    +G+R+ + 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSD-NSKIQAIW
          S+SHL F DDSL+FC    N    I+  L  Y +ASD+                  +IW       ++VK+ +HLA   +E ++AS SD NSK    W
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSD-NSKIQAIW

Query:  KNFWKIK---------------------AIPRAKICEQI----------GIKHAF-------------DMGVPAWDALLIWEG-----LVERLEEEEISI
        K+FW +K                     A+ R K+ + +           I HA              D  +    A  + +G     L     + +  +
Subjt:  KNFWKIK---------------------AIPRAKICEQI----------GIKHAF-------------DMGVPAWDALLIWEG-----LVERLEEEEISI

Query:  AILILWNIWTTRNKVINNGYKAD--------QNQISKIIEANISEHFKVRDTNLSASRSESHHSQ----YLWSPPPPPFLKLNSDASWSSSEEIGGLGWT
         +  +W IW+ RNKV++ G   +        Q+ + K   +    H     T++S + + +   Q      W PP    LKLN DA+ +++ +  G+G  
Subjt:  AILILWNIWTTRNKVINNGYKAD--------QNQISKIIEANISEHFKVRDTNLSASRSESHHSQ----YLWSPPPPPFLKLNSDASWSSSEEIGGLGWT

Query:  VRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRA
        VRD +G +V A +   +  + + E+EA A+   L+  LQ      +   +E+DS  +   + +   DLS    ++ ++  L       +     R+ N  
Subjt:  VRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRA

Query:  THTLAKDA
         H LAK A
Subjt:  THTLAKDA

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein1.1e-0522.39Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        + LD  KA+++++  F+ +++ + G    ++  +         ++ +NG   +      G +QG P+SPYLF +  E L+  + +++   + +GI+I K 
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETY
           IS L   DD +V+    +N    +  ++ ++
Subjt:  CHSISHLFFVDDSLVFCRAEENDCSTIKGILETY

P92555 Uncharacterized mitochondrial protein AtMg012501.9e-1351.47Show/hide
Query:  LINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKICHSISHLFFVDDS
        +ING PQ    P+RGL+QGDP+SPYLF+LC E LSGL  R +   +  GIR++     I+HL F DD+
Subjt:  LINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKICHSISHLFFVDDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)7.2e-0527.83Show/hide
Query:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI
        + LD+ KA++ V    +   M   G  +G    +M  +     ++++ G   +      G+KQGDP+SP LF +  + L   LN E+      G  +   
Subjt:  MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKI

Query:  CHSISHLFFVDDSLV
        C  I+ L F DD L+
Subjt:  CHSISHLFFVDDSLV

Q8GWK2 AP2-like ethylene-responsive transcription factor At2g417101.0e-2279.01Show/hide
Query:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR
        MAS SSSD G K EAG CS    GGG  ESSE V A+DQ+LLYRG KKAKKERGCTAKERISKMPPC AGKRSSIYRGVTR
Subjt:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR

X5HYT8 AP2-like ethylene-responsive transcription factor SMOS11.1e-0848.42Show/hide
Query:  MASSSSDPGFKHEAGACSAAAAGGGTAE-------------SSEVVMANDQLLL--YRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR
        MAS     G + +  A +AAA GG  AE               E V A  + +    R    A+KER CTAKERIS+MPPCAAGKRSSIYRGVTR
Subjt:  MASSSSDPGFKHEAGACSAAAAGGGTAE-------------SSEVVMANDQLLL--YRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR

Arabidopsis top hitse value%identityAlignment
AT2G41710.1 Integrase-type DNA-binding superfamily protein7.1e-2479.01Show/hide
Query:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR
        MAS SSSD G K EAG CS    GGG  ESSE V A+DQ+LLYRG KKAKKERGCTAKERISKMPPC AGKRSSIYRGVTR
Subjt:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR

AT2G41710.2 Integrase-type DNA-binding superfamily protein7.1e-2479.01Show/hide
Query:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR
        MAS SSSD G K EAG CS    GGG  ESSE V A+DQ+LLYRG KKAKKERGCTAKERISKMPPC AGKRSSIYRGVTR
Subjt:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR

AT2G41710.3 Integrase-type DNA-binding superfamily protein7.1e-2479.01Show/hide
Query:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR
        MAS SSSD G K EAG CS    GGG  ESSE V A+DQ+LLYRG KKAKKERGCTAKERISKMPPC AGKRSSIYRGVTR
Subjt:  MAS-SSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1324.42Show/hide
Query:  ILWNIWTTRNKVINNGYKADQNQISKIIEANISE-HFKVRDTNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNM
        +LW +W  RN+++  G + +  ++ +  E ++ E   +    +       +  S   W PPP  ++K N+DA+W+   E  G+GW +R+  G + + G  
Subjt:  ILWNIWTTRNKVINNGYKADQNQISKIIEANISE-HFKVRDTNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSEEIGGLGWTVRDSSGSLVFAGNM

Query:  KTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTI
           K    LE E  AM+  + S    S  +   ++ ESDS  LI+++ N E     ++  ++++  L      + FV+  R  N     +A+++      
Subjt:  KTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATHTLAKDARLAGTI

Query:  EFPLIFVDVSMASSSSD
        +  L  +  S A SS D
Subjt:  EFPLIFVDVSMASSSSD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1451.47Show/hide
Query:  LINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKICHSISHLFFVDDS
        +ING PQ    P+RGL+QGDP+SPYLF+LC E LSGL  R +   +  GIR++     I+HL F DD+
Subjt:  LINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKICHSISHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTGACATGAGCAAAGCGTACAACAGGGTTGAATGGGTGTTCCTCAGGGAGATTATGGCCAAGATTGGTTTTTGTGAGGGATGGATTGCTAAGGTTATGATGTG
CGTGGAATCAGTGGAATTCTCAGTCTTGATCAATGGAGTTCCTCAAGATCCGTTCATGCCGAACAGAGGGCTCAAACAAGGAGACCCCATCTCCCCCTATCTGTTCCTCC
TTTGCGCTGAAGGGCTTTCGGGGCTCCTTAACAGGGAAGAATCCTCTCTCAAATTTAGAGGCATTAGAATAAACAAAATTTGCCATTCCATTTCTCATTTATTCTTTGTT
GACGATAGCTTGGTTTTTTGTAGGGCTGAGGAGAATGATTGCAGTACTATAAAGGGTATTCTAGAGACGTACGAGAAGGCATCAGATCAGGATGCCGAGGAGATTCTAAA
GATCCCGCTGGGAAGCAAAAATGCTATGGACGAAATCATATGGGCTAAGCACCCTAAAGGGTCTTTCTCTGTGAAAACAGCCTACCATCTGGCGGTTAACCAAGAAGAGC
ATCGAAAAGCTAGCTGCTCGGATAATTCCAAGATCCAAGCTATATGGAAGAACTTTTGGAAGATAAAAGCAATCCCCAGAGCCAAAATCTGTGAGCAAATAGGAATCAAA
CACGCATTTGATATGGGAGTGCCTGCGTGGGATGCTCTATTGATTTGGGAAGGTCTAGTGGAAAGACTCGAGGAGGAAGAGATAAGCATCGCCATCCTAATTCTATGGAA
TATTTGGACAACTCGGAACAAGGTGATCAACAATGGCTACAAAGCAGATCAAAATCAAATCAGTAAAATTATTGAAGCCAACATATCCGAGCATTTCAAGGTTAGAGATA
CTAACCTGAGTGCTTCTAGATCGGAGAGTCATCATAGTCAATATTTGTGGTCTCCTCCTCCCCCACCTTTTCTGAAGCTGAATTCCGATGCCTCTTGGAGCAGTTCAGAA
GAAATTGGGGGCTTAGGTTGGACGGTTCGTGACTCCTCAGGGTCTCTGGTATTCGCCGGCAACATGAAGACAAAGAAGAATTGGGAAACGTTGGAGTTGGAAGCTCTAGC
TATGAAGGAAGGGTTGTCTTCTTTCTTGCAAAATAGCGCATGCAAGGATCGCCCTCTTGTGGTTGAATCGGACTCGAGCACTCTGATTAAGTTGATCCAAAATCAAGAGT
CTGATCTCTCTGAGGTAAGAAGCATTGTGGAAGAGATTGGGCTTCTGGCAAGGAGCGCAATAGGAATCTCCTTCGTTTGGTGTTCGAGAAGCTGCAATCGAGCAACCCAC
ACCCTTGCTAAGGATGCGAGATTGGCAGGAACTATAGAGTTTCCTTTGATTTTTGTTGATGTTTCCATGGCTTCTTCGTCCTCTGATCCTGGCTTCAAACACGAAGCCGG
TGCCTGTAGCGCTGCTGCTGCTGGAGGAGGAACGGCGGAGTCTTCCGAGGTTGTAATGGCGAATGATCAGCTTTTACTGTACAGAGGATTGAAGAAAGCGAAGAAAGAGA
GAGGTTGCACTGCCAAAGAGCGCATCAGTAAAATGCCTCCATGTGCTGCTGGAAAAAGAAGCTCCATATATCGCGGAGTCACTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTTGACATGAGCAAAGCGTACAACAGGGTTGAATGGGTGTTCCTCAGGGAGATTATGGCCAAGATTGGTTTTTGTGAGGGATGGATTGCTAAGGTTATGATGTG
CGTGGAATCAGTGGAATTCTCAGTCTTGATCAATGGAGTTCCTCAAGATCCGTTCATGCCGAACAGAGGGCTCAAACAAGGAGACCCCATCTCCCCCTATCTGTTCCTCC
TTTGCGCTGAAGGGCTTTCGGGGCTCCTTAACAGGGAAGAATCCTCTCTCAAATTTAGAGGCATTAGAATAAACAAAATTTGCCATTCCATTTCTCATTTATTCTTTGTT
GACGATAGCTTGGTTTTTTGTAGGGCTGAGGAGAATGATTGCAGTACTATAAAGGGTATTCTAGAGACGTACGAGAAGGCATCAGATCAGGATGCCGAGGAGATTCTAAA
GATCCCGCTGGGAAGCAAAAATGCTATGGACGAAATCATATGGGCTAAGCACCCTAAAGGGTCTTTCTCTGTGAAAACAGCCTACCATCTGGCGGTTAACCAAGAAGAGC
ATCGAAAAGCTAGCTGCTCGGATAATTCCAAGATCCAAGCTATATGGAAGAACTTTTGGAAGATAAAAGCAATCCCCAGAGCCAAAATCTGTGAGCAAATAGGAATCAAA
CACGCATTTGATATGGGAGTGCCTGCGTGGGATGCTCTATTGATTTGGGAAGGTCTAGTGGAAAGACTCGAGGAGGAAGAGATAAGCATCGCCATCCTAATTCTATGGAA
TATTTGGACAACTCGGAACAAGGTGATCAACAATGGCTACAAAGCAGATCAAAATCAAATCAGTAAAATTATTGAAGCCAACATATCCGAGCATTTCAAGGTTAGAGATA
CTAACCTGAGTGCTTCTAGATCGGAGAGTCATCATAGTCAATATTTGTGGTCTCCTCCTCCCCCACCTTTTCTGAAGCTGAATTCCGATGCCTCTTGGAGCAGTTCAGAA
GAAATTGGGGGCTTAGGTTGGACGGTTCGTGACTCCTCAGGGTCTCTGGTATTCGCCGGCAACATGAAGACAAAGAAGAATTGGGAAACGTTGGAGTTGGAAGCTCTAGC
TATGAAGGAAGGGTTGTCTTCTTTCTTGCAAAATAGCGCATGCAAGGATCGCCCTCTTGTGGTTGAATCGGACTCGAGCACTCTGATTAAGTTGATCCAAAATCAAGAGT
CTGATCTCTCTGAGGTAAGAAGCATTGTGGAAGAGATTGGGCTTCTGGCAAGGAGCGCAATAGGAATCTCCTTCGTTTGGTGTTCGAGAAGCTGCAATCGAGCAACCCAC
ACCCTTGCTAAGGATGCGAGATTGGCAGGAACTATAGAGTTTCCTTTGATTTTTGTTGATGTTTCCATGGCTTCTTCGTCCTCTGATCCTGGCTTCAAACACGAAGCCGG
TGCCTGTAGCGCTGCTGCTGCTGGAGGAGGAACGGCGGAGTCTTCCGAGGTTGTAATGGCGAATGATCAGCTTTTACTGTACAGAGGATTGAAGAAAGCGAAGAAAGAGA
GAGGTTGCACTGCCAAAGAGCGCATCAGTAAAATGCCTCCATGTGCTGCTGGAAAAAGAAGCTCCATATATCGCGGAGTCACTAGGTAA
Protein sequenceShow/hide protein sequence
MKLDMSKAYNRVEWVFLREIMAKIGFCEGWIAKVMMCVESVEFSVLINGVPQDPFMPNRGLKQGDPISPYLFLLCAEGLSGLLNREESSLKFRGIRINKICHSISHLFFV
DDSLVFCRAEENDCSTIKGILETYEKASDQDAEEILKIPLGSKNAMDEIIWAKHPKGSFSVKTAYHLAVNQEEHRKASCSDNSKIQAIWKNFWKIKAIPRAKICEQIGIK
HAFDMGVPAWDALLIWEGLVERLEEEEISIAILILWNIWTTRNKVINNGYKADQNQISKIIEANISEHFKVRDTNLSASRSESHHSQYLWSPPPPPFLKLNSDASWSSSE
EIGGLGWTVRDSSGSLVFAGNMKTKKNWETLELEALAMKEGLSSFLQNSACKDRPLVVESDSSTLIKLIQNQESDLSEVRSIVEEIGLLARSAIGISFVWCSRSCNRATH
TLAKDARLAGTIEFPLIFVDVSMASSSSDPGFKHEAGACSAAAAGGGTAESSEVVMANDQLLLYRGLKKAKKERGCTAKERISKMPPCAAGKRSSIYRGVTR