; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023022 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023022
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:42994443..42996781
RNA-Seq ExpressionLag0023022
SyntenyLag0023022
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3462911.1 reverse transcriptase [Gossypium australe]1.4e-3824.68Show/hide
Query:  GVDVVLRSYSKYHVDVVIRSD--EGHWRFTGFY------RDPEVDALGSR------------GDFNELV-SAGRNNVEVRDQKQMQEFREAVDDCGLTDM
        G+ + LRSYSK H+DV+I+ D  +  WRFTGFY          V  L  R            GDFNE++ S  +     RD ++M+ FR+ + +CGL D+
Subjt:  GVDVVLRSYSKYHVDVVIRSD--EGHWRFTGFY------RDPEVDALGSR------------GDFNELV-SAGRNNVEVRDQKQMQEFREAVDDCGLTDM

Query:  AFKG------------------------------------VEHLKFRLSEHCPIRIEVSLK---------------------ASLLEGGVDLCFD--LRR
         + G                                    ++HL F  S+HCP+ +  +                       A +L+   +   +  + +
Subjt:  AFKG------------------------------------VEHLKFRLSEHCPIRIEVSLK---------------------ASLLEGGVDLCFD--LRR

Query:  VSQCARELSSWGKL---KKGNYGRRISEARESL--------------------------------QSRLKRWRGEGSEELRWFHQRATQRRKTNRMEGLF
        +      L  W K    +KG   ++++E  E+L                                Q     W   G     +FH+ AT R+K N +  L 
Subjt:  VSQCARELSSWGKL---KKGNYGRRISEARESL--------------------------------QSRLKRWRGEGSEELRWFHQRATQRRKTNRMEGLF

Query:  DKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFI----------------
           G  + +E G++E    YF+ LF                             P++E+++ A+LK MGP+KAPG D     F                 
Subjt:  DKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFI----------------

Query:  ------KG------TEVWLGRRDAQRVS--DFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL-
              KG      T++ L  + +Q  +  +FRPISLC+VIYK+++K+IANRL+ V+   I  +Q+AF+PGR I DN IL +E LH  +QK+ G++ Y+ 
Subjt:  ------KG------TEVWLGRRDAQRVS--DFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL-

Query:  -----------------------------------------------REARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGS
                                                       R AR++   L+ Y+  +GQ +N+ KS ++   N  +  +  +++LL+V +  +
Subjt:  -----------------------------------------------REARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGS

Query:  HERYLGLPVGFTGGKMEALK
         E+YLGLP      K EA +
Subjt:  HERYLGLPVGFTGGKMEALK

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]1.3e-3929.4Show/hide
Query:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVIRSDE--GHWRFTGFY
        C G G    ++ L+ L++E+ P ++F +        IE   CK                        GV V +R+YS  HVD V+ SD   G+WRFTGFY
Subjt:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVIRSDE--GHWRFTGFY

Query:  RDPEV-------DALGSRG-----------DFNE-LVSAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG-----------------------------
          PEV       + L   G           DFNE L+ + +  +  R + Q+  F+ A+ DC L D+ F G                             
Subjt:  RDPEV-------DALGSRG-----------DFNE-LVSAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG-----------------------------

Query:  ------VEHLKFRLSEHCPIRIEVSLKASLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRIS---------EARESL--QSRLKRWRGEGSEELR
              V HL    S+H PI ++       L G   L   L  V    R L     L +  YG+R+          E  E+L  Q     W  EG     
Subjt:  ------VEHLKFRLSEHCPIRIEVSLKASLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRIS---------EARESL--QSRLKRWRGEGSEELR

Query:  WFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQR
        +FH +A QR K  R++GL D   RW    + +E I+  YF ELF                            RPY+ E++  +L QM P KAPG D    
Subjt:  WFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQR

Query:  CF------IKGTEVW---LG---------------------RRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILG
         F      I G +V    LG                      +  +R+S FRPISLCNV+YK++SK +ANR++++L  IIS +Q+AFV GR I DN +  
Subjt:  CF------IKGTEVW---LG---------------------RRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILG

Query:  FESLHYMKQKKKGRRAY
        FE  H++K K+ G+  +
Subjt:  FESLHYMKQKKKGRRAY

XP_028113864.1 uncharacterized protein LOC114311894 [Camellia sinensis]8.7e-4128.33Show/hide
Query:  VDVVLRSYSKYHVDVVIRSDEG--HWRFTGFYRDPEVDALGS------------------RGDFNELVSA-GRNNVEVRDQKQMQEFREAVDDCGLTDMA
        + + ++SYSK HVD +I ++ G   W+FTGFY +P     G                    GDFNE++ A  ++ +  R Q+QM  FR+ + DC L D+ 
Subjt:  VDVVLRSYSKYHVDVVIRSDEG--HWRFTGFYRDPEVDALGS------------------RGDFNELVSA-GRNNVEVRDQKQMQEFREAVDDCGLTDMA

Query:  FKG-VEHLKFRL-------------SEHCPIRIEVSLKASLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELR
        F G + H+  +L             S +  +     L+A  +    DL  +LR V     EL     L+K        E+    Q     W  +      
Subjt:  FKG-VEHLKFRL-------------SEHCPIRIEVSLKASLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELR

Query:  WFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQR
        +FH +A+QR+K   + GL +  G W  +   +E+I+  YFQ+LF                            RP+  +++  +L QM P+KAPG D +  
Subjt:  WFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQR

Query:  CFIKGTEVW-LGRRDAQRV-------------------------------SDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAI
         F +  + W L   DA R                                S F PISLCNV+YK++SK +ANRLKEVLD ++S +Q+AFVPGR I DN +
Subjt:  CFIKGTEVW-LGRRDAQRV-------------------------------SDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAI

Query:  LGFESLHYMKQKKKGRRAYLR-----------------------------------------EARMVLNT----LRTYSVMTGQEINYGKSGIYLRPNVR
          FE  HY+K K+ GR  Y                                              ++LNT    LR Y +  GQ +N+ KS I   PNVR
Subjt:  LGFESLHYMKQKKKGRRAYLR-----------------------------------------EARMVLNT----LRTYSVMTGQEINYGKSGIYLRPNVR

Query:  DDVRMRIANLLQVSLVGSHERYLGLP
         + ++ ++ ++ V  + + E+Y G+P
Subjt:  DDVRMRIANLLQVSLVGSHERYLGLP

XP_030924743.1 uncharacterized protein LOC115951733 [Quercus lobata]2.7e-4227.82Show/hide
Query:  CSGCGESSGIQYLRQLVREENPQLLFF---LIEDQECKSEGVDVVLRSYSKYHVDVVIRSDE-GHWRFTGFYRDPE----------VDALGSR-------
        C   G    +Q L  +V+ ++P ++F       ++  K  G+DV + S+  YH+D ++       WRFTGFY +P+          +  L S+       
Subjt:  CSGCGESSGIQYLRQLVREENPQLLFF---LIEDQECKSEGVDVVLRSYSKYHVDVVIRSDE-GHWRFTGFYRDPE----------VDALGSR-------

Query:  -GDFNELVSAGRNNVEV-RDQKQMQEFREAVDDCGLTDMAFKGVE-----------------------------------HLKFRLSEHCPIRIEVSLKA
         GDF EL+        V R    MQ FR+A+D CG  D+ F G+E                                   HL    S+H PI + +   +
Subjt:  -GDFNELVSAGRNNVEV-RDQKQMQEFREAVDDCGLTDMAFKGVE-----------------------------------HLKFRLSEHCPIRIEVSLKA

Query:  SLLEGGVDLCFD--------------------------------LRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELRWFHQRAT
           +      F+                                LRR S   RE     +LKK        E +   Q    +W   G +  ++FH  AT
Subjt:  SLLEGGVDLCFD--------------------------------LRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELRWFHQRAT

Query:  QRRKTNRMEGLFDKMGRWVEEE-------DGMEEIISDYFQ-ELFRPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKGTEVWLGRRDAQRVSDFRPISLC
        QR++ N ++GL D  G W E E       DG+E ++ +  + +L RP+S E++  ++K+M PLKAPG D                ++ +RVSDFRPISLC
Subjt:  QRRKTNRMEGLFDKMGRWVEEE-------DGMEEIISDYFQ-ELFRPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKGTEVWLGRRDAQRVSDFRPISLC

Query:  NVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQK---KKG-----------------------------------------
        NVIYKIISK IANRLK +L+ IIS +Q+AF+  R I DN ++ FESL++MK     KKG                                         
Subjt:  NVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQK---KKG-----------------------------------------

Query:  --------------------RRAYLREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP
                             R+ L E   +   L  Y V +GQ IN  K+ +Y   N     +  I   L V  +  +E+YLGLP
Subjt:  --------------------RRAYLREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP

XP_030945781.1 uncharacterized protein LOC115970260 [Quercus lobata]1.0e-4129.4Show/hide
Query:  GDFNELVSAGRNNVEV-RDQKQMQEFREAVDDCGLTDMAFKG-----------------------------------VEHLKFRLSEHCPIRI-------
        GDFNEL+        V R    MQ FR+A+D CG  D+ F G                                   V+HL    S+H PI +       
Subjt:  GDFNELVSAGRNNVEV-RDQKQMQEFREAVDDCGLTDMAFKG-----------------------------------VEHLKFRLSEHCPIRI-------

Query:  ----------------------EVSLKA-SLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQS---------------RLKR-----
                              EV  +A      G  +     ++ +C + L SW ++  GN  ++I + ++ L                 RLK+     
Subjt:  ----------------------EVSLKA-SLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQS---------------RLKR-----

Query:  ---------------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQE-LFRPYSEEDLLASLKQMGPLKAPGEDDSQRCF
                       W   G +  ++FH  ATQR++ N ++GL D  G WV   DG+E ++++   E L RP++ E+L  ++K+M PLKAPG D     F
Subjt:  ---------------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQE-LFRPYSEEDLLASLKQMGPLKAPGEDDSQRCF

Query:  IK--GTEVWLGRRDA----------------------------QRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFE
         +   T++ +    A                            +RVSDFRPISLCNVIYKIISK IANRLK +L+ IIS +Q+AF+  R I DN ++ FE
Subjt:  IK--GTEVWLGRRDA----------------------------QRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFE

Query:  SLHYMKQKKKGRRAY------LREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP
        SLH+MK    G++ +      L E   +   L  Y V +GQ IN  K+ +Y   N  +  +  +   L V  +  +E+YLGLP
Subjt:  SLHYMKQKKKGRRAY------LREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP

TrEMBL top hitse value%identityAlignment
A0A2N9EVR9 F-box domain-containing protein1.3e-4227.09Show/hide
Query:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVI-RSDEGHWRFTGFYR
        C G G    +Q L +LV  ++P ++F +        +E   C+                       + V++ + S+S  H+D V+  + E  WRFTGFY 
Subjt:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVI-RSDEGHWRFTGFYR

Query:  DPE----------VDALGSR--------GDFNELVS-AGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG------------------------------
         PE          +  L S+        GDFNELV    +     R ++QMQ FR+ +D+CG  D+ F G                              
Subjt:  DPE----------VDALGSR--------GDFNELVS-AGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG------------------------------

Query:  -----VEHLKFRLSEHCPIRIEV------SLKASLLE---------------------GGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESL-
             V HL  R S+H P+ + +      S K    E                      GV +     ++  C REL  W K   GN   +I E  + L 
Subjt:  -----VEHLKFRLSEHCPIRIEV------SLKASLLE---------------------GGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESL-

Query:  ----------------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEE--EDGMEEIISDYFQELFRPYS
                                    + RL R      W   G    R+ H RATQR++ N +  L +  G W  +  +  M  +     ++L R ++
Subjt:  ----------------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEE--EDGMEEIISDYFQELFRPYS

Query:  EEDLLASLKQMGPLKAPGEDDSQRCF------IKGTEVW--------LGR----------------RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVL
          +++ +LKQM PLKAPG D     F      + G EV          G+                ++ + V DFRPISLCNVIYK+ISK +ANRLK +L
Subjt:  EEDLLASLKQMGPLKAPGEDDSQRCF------IKGTEVW--------LGR----------------RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVL

Query:  DRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR-----------RAYLR---------------EARMVLNTL------RTYSVMTGQEINYGKS
          I+S SQ+ FVPGR I DN ++ FE+LH+M+ +K  R           +AY R                ++   +TL        Y   +GQ+IN  K+
Subjt:  DRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR-----------RAYLR---------------EARMVLNTL------RTYSVMTGQEINYGKS

Query:  GIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP
         I+   +  D  ++ I N+L V  +  +ERYLGLP
Subjt:  GIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP

A0A2N9F204 Uncharacterized protein2.7e-4828.41Show/hide
Query:  YRRTLISRVAVSNENNMLECSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHV
        YRR L    A S+E   +E S   E +  Q L +LV  ++P ++F +        +E   CK                          ++ + S+S  H+
Subjt:  YRRTLISRVAVSNENNMLECSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHV

Query:  DVVIRSDEGH-WRFTGFYRDPEVD----------ALGSR--------GDFNELV----SAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKGVEH--LKF
        D V+  +  + WRFTGFY  PE             L S+        GDFNELV      GR+N   R ++QMQ FR+ +DDCG  D+ F G +      
Subjt:  DVVIRSDEGH-WRFTGFYRDPEVD----------ALGSR--------GDFNELV----SAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKGVEH--LKF

Query:  RLSEHCPIRIEVSL----------KASLLEGGVDLCFDL------------------RRVSQCARELSSWGKLKKGNYGRRISEARESL-----------
        R+ +    R++ ++           A +L  G   C +                    ++  C REL SW K   GN   +I E    L           
Subjt:  RLSEHCPIRIEVSL----------KASLLEGGVDLCFDL------------------RRVSQCARELSSWGKLKKGNYGRRISEARESL-----------

Query:  ------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEE--------EDGMEEIISDYFQELFRPYSEEDL
                          + RL R      W   G    R+FH RATQR++ N +  L +  GRW           E  M  +  +   +L R +S  ++
Subjt:  ------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEE--------EDGMEEIISDYFQELFRPYSEEDL

Query:  LASLKQMGPLKAPGEDDSQRCFIK------GTEVW--------LGR----------------RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRII
        +A+LKQM PLKAPG D     F +      G EV          G+                ++ + V DFRPISLCNVIYKIISK +ANRLK +L +I+
Subjt:  LASLKQMGPLKAPGEDDSQRCFIK------GTEVW--------LGR----------------RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRII

Query:  SPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR--------------------------------------RAYLREARMVLNTLRTYSVMTGQEINYG
        S SQ+AFVPGR I DN ++ FE+LH+M+ +KKGR                                      +A   +   +   L  Y   +GQ+IN  
Subjt:  SPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR--------------------------------------RAYLREARMVLNTLRTYSVMTGQEINYG

Query:  KSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP
        K+ I+   +     +  + N+L V  +  +ERYLGLP
Subjt:  KSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP

A0A2N9FBV3 Reverse transcriptase domain-containing protein6.5e-4228.8Show/hide
Query:  LECSGCGESSGIQYLRQLVREENPQLLFFLIEDQECKS------------------------------EGVDVVLRSYSKYHVDVVIR-SDEGHWRFTGF
        L C G      +  LR LV++E PQ++F      + KS                              +  D+ + SYS+ H+D V+R SD+  WRFTGF
Subjt:  LECSGCGESSGIQYLRQLVREENPQLLFFLIEDQECKS------------------------------EGVDVVLRSYSKYHVDVVIR-SDEGHWRFTGF

Query:  YRDPE----VDALG--------------SRGDFNELVSAGRN-NVEVRDQKQMQEFREAVDDCGLTDMAFKG----------------------------
        Y +PE    +D+                 RGDFNE+V +G    +  R Q QM+ FR A+ DC L+D+ F+G                            
Subjt:  YRDPE----VDALG--------------SRGDFNELVSAGRN-NVEVRDQKQMQEFREAVDDCGLTDMAFKG----------------------------

Query:  --------VEHLKFRLSEHCPIRIEVSLK---------------ASLLEGGVDLCFDLRRVSQCAREL----SSWGKLKKGNYGRRISEARESLQSRLKR
                V +L+F  S+H  + ++   +               A +   G   C D+ R +  +           K K  N+     E     +SR+  
Subjt:  --------VEHLKFRLSEHCPIRIEVSLK---------------ASLLEGGVDLCFDLRRVSQCAREL----SSWGKLKKGNYGRRISEARESLQSRLKR

Query:  WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPL
        W  EG    R+FH  ATQR++ NR+  L ++ G WV +ED +  +  D+F+ LF                            R +   ++  +L Q+ P 
Subjt:  WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPL

Query:  KAPGEDDSQRCFIKGTEVWL--GRRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL
        KA G D      +  T++ L    +  + +  FRPI+LCNVIYK ISK +ANRLK +L  IIS SQ+ FVPGR I DN ++ FE+LHYMK K++G+  ++
Subjt:  KAPGEDDSQRCFIKGTEVWL--GRRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL

A0A2N9HLP3 Uncharacterized protein4.1e-4425.77Show/hide
Query:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVIRSDEGH-WRFTGFYR
        C G G    +Q L +LV  ++P ++F +        +E   CK                         +++ + S+S  H+D V+  +  + WRFTGFY 
Subjt:  CSGCGESSGIQYLRQLVREENPQLLFFL--------IEDQECK----------------------SEGVDVVLRSYSKYHVDVVIRSDEGH-WRFTGFYR

Query:  DPEVD----------ALGSR--------GDFNELV----SAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG---------------------------
         PE             L S+        GDFNELV      GR++   R ++QMQ FR+ +DDCG  D+ F G                           
Subjt:  DPEVD----------ALGSR--------GDFNELV----SAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKG---------------------------

Query:  --------VEHLKFRLSEHCPIRIEVS-LKASLLE--------------------------GGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARE
                V HL  R S+H P+ +  + +K  + +                           GV +     ++ +C R+L SW K   GN   +I E   
Subjt:  --------VEHLKFRLSEHCPIRIEVS-LKASLLE--------------------------GGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARE

Query:  SL-----------------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF---
         L                             + RL R      W   G    R+FH RATQR++ N +  L +  GRW      +  +  D++  LF   
Subjt:  SL-----------------------------QSRLKR------WRGEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF---

Query:  -------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKG---------TEVWL-----GR----------------RDAQR
                                 R ++  +++A+LKQM PLKAPG D     F +          TEV L     G+                ++ + 
Subjt:  -------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKG---------TEVWL-----GR----------------RDAQR

Query:  VSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR---------------------------------
        V DFRPI LCNVIYKIISK +ANRLK +L +IIS SQ+AFVPGR I DN ++ FE+LH+M+ +KKGR                                 
Subjt:  VSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGR---------------------------------

Query:  ------------------RAYLREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP
                          +A + +   +   L  Y   +GQ+IN  K+ I+   +     +  + N+L V  +  +ERYLGLP
Subjt:  ------------------RAYLREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVGSHERYLGLP

A0A2N9IZB6 Reverse transcriptase domain-containing protein2.7e-4029.75Show/hide
Query:  LECSGCGESSGIQYLRQLVREENPQLLFFL---------------IEDQEC---------------KSEGVDVVLRSYSKYHVDVVIRSDEGH-WRFTGF
        L C G G    +  L  LVR++ P ++F +               +  Q C                   V V ++SYS YH+D  +  ++G  WR TGF
Subjt:  LECSGCGESSGIQYLRQLVREENPQLLFFL---------------IEDQEC---------------KSEGVDVVLRSYSKYHVDVVIRSDEGH-WRFTGF

Query:  YRDPEV-------------DALGSR-----GDFNELVSAGRN-NVEVRDQKQMQEFREAVDDCGLTDMAFKG----------------------------
        Y  PEV              AL +       DFNE +S       E R   QM  F+EA+ D  L D+ FKG                            
Subjt:  YRDPEV-------------DALGSR-----GDFNELVSAGRN-NVEVRDQKQMQEFREAVDDCGLTDMAFKG----------------------------

Query:  --------VEHLKFRLSEHCPIRIEV---SLKASLL--EGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELRWFHQ
                V H+    S+H  + +E+     +A LL  E    + +D + V+   +E++S   L K     R        Q     W  +G +   +FH+
Subjt:  --------VEHLKFRLSEHCPIRIEV---SLKASLL--EGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWRGEGSEELRWFHQ

Query:  RATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFIK
         A QR++TN ++GL D+   W  +   +E+I + YF  LF                            RP+S +++  +L QM P KAPG D      +K
Subjt:  RATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELF----------------------------RPYSEEDLLASLKQMGPLKAPGEDDSQRCFIK

Query:  GTEVWLGRRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL
          E          +S FRPISLCNVIYKIISK + NR+K+VL R+IS SQ AFVPGR I DN I+ FE++H++K  + G+ A L
Subjt:  GTEVWLGRRDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.8e-0441.67Show/hide
Query:  RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPG
        RD  +  +FRPISL N+  KI++K +ANR+++ + ++I   Q  F+PG
Subjt:  RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPG

P11369 LINE-1 retrotransposable element ORF2 protein5.0e-0731.36Show/hide
Query:  DYFQELFRPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKGTEVWLGR--RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNI
        D F   F    +EDL+  L ++   K   E      F + T   + +  +D  ++ +FRPISL N+  KI++K +ANR++E +  II P Q  F+PG   
Subjt:  DYFQELFRPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKGTEVWLGR--RDAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNI

Query:  CDNAILGFESLHYMKQKK
          N       +HY+ + K
Subjt:  CDNAILGFESLHYMKQKK

P14381 Transposon TX1 uncharacterized 149 kDa protein4.5e-0842.42Show/hide
Query:  DAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQ
        D + + ++RP+SL +  YKI++K+I+ RLK VL  +I P Q+  VPGR I DN  L  + LH+ ++
Subjt:  DAQRVSDFRPISLCNVIYKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAGGAGAGATGTATCGGCGAACCCTGATCTCCAGGGTCGCCGTCAGCAATGAAAATAATATGCTGGAATGTTCGGGGTGTGGGGAATCCTCAGGCATTCAATA
CTTGCGCCAGTTGGTGCGGGAAGAGAATCCCCAACTCTTGTTCTTTCTAATCGAAGATCAAGAGTGCAAAAGCGAAGGGGTGGATGTGGTTTTGCGATCCTATAGCAAAT
ACCATGTCGATGTTGTGATTAGATCTGATGAGGGCCATTGGAGGTTTACCGGGTTCTATAGAGACCCAGAAGTGGATGCCTTGGGTTCTAGGGGTGATTTCAATGAGTTA
GTGAGTGCAGGGAGAAACAATGTGGAAGTGAGGGATCAGAAGCAGATGCAGGAATTCAGAGAGGCTGTTGATGACTGTGGTCTGACTGATATGGCATTTAAAGGTGTGGA
GCATTTGAAGTTTAGATTGTCTGAACATTGCCCTATTCGAATTGAGGTGAGCCTTAAGGCGAGCCTGTTAGAAGGGGGAGTAGACCTCTGTTTCGATTTGAGGAGGGTGA
GTCAGTGTGCCAGAGAATTGTCTAGTTGGGGCAAGTTGAAGAAGGGGAACTATGGCAGGAGGATTTCTGAGGCTCGAGAGAGTCTCCAGTCGAGGTTGAAGAGGTGGAGA
GGTGAGGGGAGCGAGGAACTCAGATGGTTTCATCAACGAGCCACTCAACGACGGAAGACTAATAGGATGGAAGGCCTGTTTGATAAGATGGGCAGATGGGTGGAGGAGGA
GGATGGGATGGAGGAGATTATTTCTGATTATTTCCAGGAGTTGTTCAGGCCATATTCTGAAGAGGATCTCCTGGCTTCCCTTAAGCAGATGGGTCCATTGAAGGCCCCTG
GGGAGGATGATTCCCAGCGTTGTTTTATCAAAGGTACTGAGGTGTGGTTGGGAAGGAGAGATGCCCAGAGGGTTTCAGATTTCAGGCCCATTAGTTTGTGCAATGTGATT
TATAAAATCATATCGAAGTCCATTGCTAACAGGCTCAAGGAAGTGTTGGATCGTATAATCTCGCCATCACAGAATGCCTTTGTGCCTGGCAGGAATATTTGTGACAATGC
CATTTTGGGGTTTGAAAGCCTCCATTACATGAAGCAGAAGAAAAAGGGAAGACGGGCATATCTAAGGGAGGCAAGAATGGTGCTAAACACGCTGAGGACTTATTCAGTAA
TGACTGGGCAGGAGATTAACTATGGTAAGTCCGGGATCTATCTCAGACCAAATGTTAGAGATGATGTTCGAATGAGAATTGCTAACCTTCTTCAGGTGTCTTTGGTGGGA
TCTCATGAGCGTTATTTAGGCTTGCCAGTGGGTTTTACTGGTGGGAAAATGGAGGCCCTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAGGAGAGATGTATCGGCGAACCCTGATCTCCAGGGTCGCCGTCAGCAATGAAAATAATATGCTGGAATGTTCGGGGTGTGGGGAATCCTCAGGCATTCAATA
CTTGCGCCAGTTGGTGCGGGAAGAGAATCCCCAACTCTTGTTCTTTCTAATCGAAGATCAAGAGTGCAAAAGCGAAGGGGTGGATGTGGTTTTGCGATCCTATAGCAAAT
ACCATGTCGATGTTGTGATTAGATCTGATGAGGGCCATTGGAGGTTTACCGGGTTCTATAGAGACCCAGAAGTGGATGCCTTGGGTTCTAGGGGTGATTTCAATGAGTTA
GTGAGTGCAGGGAGAAACAATGTGGAAGTGAGGGATCAGAAGCAGATGCAGGAATTCAGAGAGGCTGTTGATGACTGTGGTCTGACTGATATGGCATTTAAAGGTGTGGA
GCATTTGAAGTTTAGATTGTCTGAACATTGCCCTATTCGAATTGAGGTGAGCCTTAAGGCGAGCCTGTTAGAAGGGGGAGTAGACCTCTGTTTCGATTTGAGGAGGGTGA
GTCAGTGTGCCAGAGAATTGTCTAGTTGGGGCAAGTTGAAGAAGGGGAACTATGGCAGGAGGATTTCTGAGGCTCGAGAGAGTCTCCAGTCGAGGTTGAAGAGGTGGAGA
GGTGAGGGGAGCGAGGAACTCAGATGGTTTCATCAACGAGCCACTCAACGACGGAAGACTAATAGGATGGAAGGCCTGTTTGATAAGATGGGCAGATGGGTGGAGGAGGA
GGATGGGATGGAGGAGATTATTTCTGATTATTTCCAGGAGTTGTTCAGGCCATATTCTGAAGAGGATCTCCTGGCTTCCCTTAAGCAGATGGGTCCATTGAAGGCCCCTG
GGGAGGATGATTCCCAGCGTTGTTTTATCAAAGGTACTGAGGTGTGGTTGGGAAGGAGAGATGCCCAGAGGGTTTCAGATTTCAGGCCCATTAGTTTGTGCAATGTGATT
TATAAAATCATATCGAAGTCCATTGCTAACAGGCTCAAGGAAGTGTTGGATCGTATAATCTCGCCATCACAGAATGCCTTTGTGCCTGGCAGGAATATTTGTGACAATGC
CATTTTGGGGTTTGAAAGCCTCCATTACATGAAGCAGAAGAAAAAGGGAAGACGGGCATATCTAAGGGAGGCAAGAATGGTGCTAAACACGCTGAGGACTTATTCAGTAA
TGACTGGGCAGGAGATTAACTATGGTAAGTCCGGGATCTATCTCAGACCAAATGTTAGAGATGATGTTCGAATGAGAATTGCTAACCTTCTTCAGGTGTCTTTGGTGGGA
TCTCATGAGCGTTATTTAGGCTTGCCAGTGGGTTTTACTGGTGGGAAAATGGAGGCCCTGAAGTAG
Protein sequenceShow/hide protein sequence
MGGGEMYRRTLISRVAVSNENNMLECSGCGESSGIQYLRQLVREENPQLLFFLIEDQECKSEGVDVVLRSYSKYHVDVVIRSDEGHWRFTGFYRDPEVDALGSRGDFNEL
VSAGRNNVEVRDQKQMQEFREAVDDCGLTDMAFKGVEHLKFRLSEHCPIRIEVSLKASLLEGGVDLCFDLRRVSQCARELSSWGKLKKGNYGRRISEARESLQSRLKRWR
GEGSEELRWFHQRATQRRKTNRMEGLFDKMGRWVEEEDGMEEIISDYFQELFRPYSEEDLLASLKQMGPLKAPGEDDSQRCFIKGTEVWLGRRDAQRVSDFRPISLCNVI
YKIISKSIANRLKEVLDRIISPSQNAFVPGRNICDNAILGFESLHYMKQKKKGRRAYLREARMVLNTLRTYSVMTGQEINYGKSGIYLRPNVRDDVRMRIANLLQVSLVG
SHERYLGLPVGFTGGKMEALK