; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028441 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028441
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:21800401..21801714
RNA-Seq ExpressionLag0028441
SyntenyLag0028441
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023896927.1 uncharacterized protein LOC112008817 [Quercus suber]4.5e-7135.55Show/hide
Query:  METKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIENDG--KTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQAWV
        MET+  +  +  L+  L + N+F V    L GGLAL W + + L+I ++S  HID +I N G    +RF GFYG P+  NR  SW+ LR L       WV
Subjt:  METKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIENDG--KTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQAWV

Query:  VAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQE
          GDFN IT+ +EK GG+    K+ Q+F   ++ C LKDLG+SG  FTW N +  +A +  RLDR +  V+ +  +    ++ L   SS H+ + L   +
Subjt:  VAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQE

Query:  QRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHL
         +  F+R   P RFE  W   E C   +   WDK +  +  + +L +++    +LK W +N  G  +  +  +++       D M  R+   +K   + +
Subjt:  QRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHL

Query:  DHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAMNAH
          ++  EE  W QRS+ +WL++GDQNTK+FH +A+ R +KN I  + D+ G   E  E I      YYS++F  + +N   ++ +L     R++D+MN  
Subjt:  DHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAMNAH

Query:  LDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
        L + F   EV  ALKQM P  APGPDGLP LFY+ +
Subjt:  LDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.8e-7338.14Show/hide
Query:  LKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIEN-DGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQAWVVAGDFNSITKENE
        +   L + + F V   GL GGLAL W++ + +NI+SYS  HID +I   DGK +R  G YG P+   +  +WT LRRL+      W+  GDFN I   NE
Subjt:  LKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIEN-DGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQAWVVAGDFNSITKENE

Query:  KDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVEL-LALYNFVVINLLGFLSSYHRVIELYLQEQRGSFHRKFTPL
        K GG   +      F   +NDC L DLG  G  FTW N + G   I +RLDR +G+ +   + YN +V NL  + S +  V+    +  + ++++K +  
Subjt:  KDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVEL-LALYNFVVINLLGFLSSYHRVIELYLQEQRGSFHRKFTPL

Query:  R--FEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSE----ELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHLDHILLE
        R  +E+ W+ +E C + +++ W KG G  S  + +T  K  S     +L+ W RN+    K ++ + K++   +  +      +  +K  E+ ++ +LL+
Subjt:  R--FEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSE----ELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHLDHILLE

Query:  EEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAMNAHLDRVFT
        EE+YWKQRSR DWLK GD+NTK+FHSKAS R++KN+I  ++D +    ++RE +E+ F  Y++ +F +S  + + IE  L     R+T  MN  LD  FT
Subjt:  EEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAMNAHLDRVFT

Query:  TSEVEEALKQMHPTKAPGPDGLPALFYQKY
          EV  AL QM PTKAPGPDGLPA F+QK+
Subjt:  TSEVEEALKQMHPTKAPGPDGLPALFYQKY

XP_025703475.1 uncharacterized protein LOC112805291 [Arachis hypogaea]2.2e-7336.77Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTG----LSGGLALFWHSSITLNIVSYSKVHID---TIIENDGKTFRFIGFYGEPKPENRHLSWTFLRRLS
        +FLMET+  +  M  ++      N+ AVDC G     +GGLA+ W ++I + + S S  HID    ++EN+ + +R  GFYG+P+ +N+H+SW  L+ L 
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTG----LSGGLALFWHSSITLNIVSYSKVHID---TIIENDGKTFRFIGFYGEPKPENRHLSWTFLRRLS

Query:  DFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHR
           + AW+V GDFN I  + EK GG+     + Q F  A+    L DLGF G+ FTW N Q G+  I++RLDR +  +E    +   ++  L    S H 
Subjt:  DFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHR

Query:  VIELYLQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVC
         I + +  ++    +     RFEE W  +E+C   +++ W    G+  P  +  RI    +EL RWG    G    +I + + + + L   + +   ++ 
Subjt:  VIELYLQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVC

Query:  LKMEEKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTA-
        +K  E  LD  L EE+ +W QRSR +WL+ GDQN+++FH KAS R+ +N ++ I D  G  HE  E IE   + YY ++F+S     + +EE  QA    
Subjt:  LKMEEKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTA-

Query:  --RLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQK
          R+       LD+ FT  EV +ALKQMHPTKAPGPDG+PALFYQK
Subjt:  --RLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQK

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.8e-7235.54Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIENDG--KTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ
        +FLMET+  +  +  L+  L F N+F V    L GGLAL W++ + L+I ++S  HID ++ N G    +RF GFYG P+  NR  SW+ LR L      
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIENDG--KTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ

Query:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY
         WV  GDFN IT+ +EK GG+    K+ Q+F   ++ C LKDLGFSG  FTW N +  +A +  RLDR +   + +  +    ++ L   SS H+ + L 
Subjt:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY

Query:  LQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE
          + +  F R   P RFE  W   E C   +   WD+  G +  + ++++++    +LK W +N  G  +  +A +++       D M  ++   +K+  
Subjt:  LQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE

Query:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAM
        + +  ++  EE  W QRS+ +WL++GDQNTK+FH +A+ R +KN I  + +++G   E  E I      YYS++F  S +N   +E +L+    R+++AM
Subjt:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAM

Query:  NAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
        N+ L + F   EV  ALKQM P  APGPDG P LFY+ +
Subjt:  NAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

XP_030932272.1 uncharacterized protein LOC115958047 [Quercus lobata]3.1e-7236.96Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIEND--GKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ
        +FLMETK     M  +K+ L       V C G SGGLAL W   + ++I S+S  HID I++    GK +R  GFYG P    R  SW  L RLS     
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIEND--GKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ

Query:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY
         WV  GDFN +   +EK+GG    +K+ + F  AIN  +L+DLG++G  FTW      +  I++RLDR + +    A +  + +      SS H ++ L 
Subjt:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY

Query:  LQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKME-
          + +    R+    RFEE W   EDC   + + WD+G    S   L + ++     L  W  N  G    ++A  + R E L  + M   S+V  ++E 
Subjt:  LQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKME-

Query:  -EKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTD
            ++++L  EEI W+QRSR +WLK GD+NT +FH+KAS R  +N I  ++D++G        IE  F+ Y+  +F +S  N    EE++QA   ++T+
Subjt:  -EKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTD

Query:  AMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
         MNA L R F  SE++ ALK M+PT APGPDG+P +FYQK+
Subjt:  AMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

TrEMBL top hitse value%identityAlignment
A0A2N9HU09 Reverse transcriptase domain-containing protein5.7e-7236.14Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA
        +FLMETK + R M S++  L F+  F+V C G SGGLAL W+    + I ++S+ H+D+ ++  +G  +RF GFYG P+   +  SW  L +L    +  
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA

Query:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL
        W+  GDFN I   +E+ G     ++  Q+F   +N C L DLGF G  FTW N + G+A I+KRLDR + N   L  +N   ++ +    S H  + L++
Subjt:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL

Query:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE
                 K  P +FEE W+LH +C + IQ  W +     SP  +L  +IK   E L RW ++ +G+F+++I        +L       ++   +   +
Subjt:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE

Query:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA
          ++ +LL EE++W+QRSR  WL  GD NTK+FHS+A+ RR+ N +  + +S+     +   IE   + Y+ D+F  S+P+N   +E+ L A  +R+T  
Subjt:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA

Query:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
         N  L + FT  EV  AL QMHP+KAPGPDG+ + F+QKY
Subjt:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

A0A2N9I475 Reverse transcriptase domain-containing protein5.7e-7236.14Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA
        +FLMETK + R M S++  L F+  F+V C G SGGLAL W+    + I ++S+ H+D+ ++  +G  +RF GFYG P+   +  SW  L +L    +  
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA

Query:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL
        W+  GDFN I   +E+ G     ++  Q+F   +N C L DLGF G  FTW N + G+A I+KRLDR + N   L  +N   ++ +    S H  + L++
Subjt:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL

Query:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE
                 K  P +FEE W+LH +C + IQ  W +     SP  +L  +IK   E L RW ++ +G+F+++I        +L       ++   +   +
Subjt:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE

Query:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA
          ++ +LL EE++W+QRSR  WL  GD NTK+FHS+A+ RR+ N +  + +S+     +   IE   + Y+ D+F  S+P+N   +E+ L A  +R+T  
Subjt:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA

Query:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
         N  L + FT  EV  AL QMHP+KAPGPDG+ + F+QKY
Subjt:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

A0A2N9ISW4 Reverse transcriptase domain-containing protein5.7e-7236.14Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA
        +FLMETK + R M S++  L F+  F+V C G SGGLAL W+    + I ++S+ H+D+ ++  +G  +RF GFYG P+   +  SW  L +L    +  
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA

Query:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL
        W+  GDFN I   +E+ G     ++  Q+F   +N C L DLGF G  FTW N + G+A I+KRLDR + N   L  +N   ++ +    S H  + L++
Subjt:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL

Query:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE
                 K  P +FEE W+LH +C + IQ  W +     SP  +L  +IK   E L RW ++ +G+F+++I        +L       ++   +   +
Subjt:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE

Query:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA
          ++ +LL EE++W+QRSR  WL  GD NTK+FHS+A+ RR+ N +  + +S+     +   IE   + Y+ D+F  S+P+N   +E+ L A  +R+T  
Subjt:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA

Query:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
         N  L + FT  EV  AL QMHP+KAPGPDG+ + F+QKY
Subjt:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

A0A2N9IT57 Reverse transcriptase domain-containing protein5.7e-7236.14Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA
        +FLMETK + R M S++  L F+  F+V C G SGGLAL W+    + I ++S+ H+D+ ++  +G  +RF GFYG P+   +  SW  L +L    +  
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIE-NDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQA

Query:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL
        W+  GDFN I   +E+ G     ++  Q+F   +N C L DLGF G  FTW N + G+A I+KRLDR + N   L  +N   ++ +    S H  + L++
Subjt:  WVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYL

Query:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE
                 K  P +FEE W+LH +C + IQ  W +     SP  +L  +IK   E L RW ++ +G+F+++I        +L       ++   +   +
Subjt:  QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLL-TRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEE

Query:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA
          ++ +LL EE++W+QRSR  WL  GD NTK+FHS+A+ RR+ N +  + +S+     +   IE   + Y+ D+F  S+P+N   +E+ L A  +R+T  
Subjt:  KHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMF-KSSPLNVQAIEEILQATTARLTDA

Query:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
         N  L + FT  EV  AL QMHP+KAPGPDG+ + F+QKY
Subjt:  MNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

A0A7N2R0C3 Reverse transcriptase domain-containing protein7.5e-7237.56Show/hide
Query:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTII--ENDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ
        +FL ETKA+QRR+  L+  L      AV   G SGGLA+ W   + +++ S S  HID ++   N    +R  GFYG P    R +SW  L  LS   + 
Subjt:  MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTII--ENDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQ

Query:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY
         WVV GDFN I   +EK G    D+++ + F   +++C L DLGF G RFTW N + G+     RLDR + N E + L+    +      +S H ++ L 
Subjt:  AWVVAGDFNSITKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELY

Query:  L--QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKM
        +  +E R    R+F    FEE WT  E C   I++ WD   G N    +  R+K    +L+ W R   G     + + + R + L E  +   S   ++ 
Subjt:  L--QEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKM

Query:  EEKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSS-PLNVQAIEEILQATTARLT
         +K ++ ++L EEI W QRSR  W+K+GD+NT++FH+ A++RR+KNKI+ I+DS G   EN E++E   L Y+ +++ S+ P    A    L A   R+T
Subjt:  EEKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSS-PLNVQAIEEILQATTARLT

Query:  DAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
        + MN  L R F   EV +AL QMHPTK+PGPDG+  +F+QKY
Subjt:  DAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.4e-0522.97Show/hide
Query:  RLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQ----EQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAG-DNSPTNLLTRIKSVSEELK
        ++D  +G+  LL+      I +  +LS  H  I+L L+     Q  S   K   L   + W +H +  ++I+  ++     D +  NL    K+V     
Subjt:  RLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQ----EQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAG-DNSPTNLLTRIKSVSEELK

Query:  RWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHLDHILLEE----EIYWKQRSREDWLKWGDQNTKWFHSKAS-----------HRRQKN
               GKF + +   KR++E    D +  +     K E+ H      +E        K+   +  L+  +++  WF  + +            +R+KN
Subjt:  RWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHLDHILLEE----EIYWKQRSREDWLKWGDQNTKWFHSKAS-----------HRRQKN

Query:  KIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQA-TTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY
        +I  I +  G+   +  +I+     YY  ++ +   N++ ++  L   T  RL       L+R  T SE+   +  +   K+PGPDG  A FYQ+Y
Subjt:  KIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQA-TTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY

P14381 Transposon TX1 uncharacterized 149 kDa protein3.4e-0520.71Show/hide
Query:  VFAVDCTGLSGGLALFWHSSITLNIVSYSKV----HIDTIIENDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDF-----PHQAWVVAGDFNSITKENEK
        VF    T  S G+   +  S    ++S + V     +   +   G+T+  +  Y       R     F   LS +       +A ++ GDFN      ++
Subjt:  VFAVDCTGLSGGLALFWHSSITLNIVSYSKV----HIDTIIENDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDF-----PHQAWVVAGDFNSITKENEK

Query:  DGGSAYDSKESQNFLSAINDCALKDLGFSGN----RFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQ-----EQRGSF
        +     DS ES      I   +L D+    N     FT++  + G    + R+DR   +  L++      I L  F  S H  + L +       +   +
Subjt:  DGGSAYDSKESQNFLSAINDCALKDLGFSGN----RFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQ-----EQRGSF

Query:  HRKFTPLRFE----------EAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDV--------
        H   + L  E            W   +D  + + Q WD G            +K + +E   + ++ +G+  + I       EAL  + +D+        
Subjt:  HRKFTPLRFE----------EAWTLHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDV--------

Query:  -RSLVCLKMEEKHLDHILLEEEIYWK-QRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEI
         ++L C  +E K     + + +      RSR   L   D+ +++F++    +  + +I  +   +G   E+ E I      +Y ++F   P++  A EE+
Subjt:  -RSLVCLKMEEKHLDHILLEEEIYWK-QRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEI

Query:  LQATTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQ
               +++     L+   T  E+ +AL+ M   K+PG DGL   F+Q
Subjt:  LQATTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-1123.05Show/hide
Query:  QAWVVAGDFNSI--TKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALY--NFVVINLLGFLSSYHR
        Q  ++ GDF+ I  T ++     ++   +  + F + + D  L D+   G  +TW N+Q     IRK LDR I N +  + +     V  L G   S H 
Subjt:  QAWVVAGDFNSI--TKENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALY--NFVVINLLGFLSSYHR

Query:  VIELYLQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPT-NLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLV
           + L+       + F   R+    + H      +   W++     S   +L   +K+  +  K   R   G  + +  E+    E++    +   S  
Subjt:  VIELYLQEQRGSFHRKFTPLRFEEAWTLHEDCASKIQQGWDKGAGDNSPT-NLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLV

Query:  CLKME---EKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSP--LNVQAIEEIL
          ++E    K  +      E +++Q+SR  WL+ GD NT++FH      + KN I+ +   +    EN   ++   + YY+ +  S    L   +++ I 
Subjt:  CLKME---EKHLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSP--LNVQAIEEIL

Query:  QATTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFY
             R  D + + L  + +  E+  A+  M   KAPGPD   A F+
Subjt:  QATTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTGATGGAAACCAAGGCGAATCAAAGGCGAATGTTTTCTCTCAAGGATGCTCTTTCTTTTCAAAATGTTTTTGCAGTTGATTGTACTGGACTTAGTGGTGGTTT
GGCTTTATTTTGGCATTCTTCAATTACTCTTAATATTGTTTCTTATTCTAAAGTGCACATTGATACTATTATTGAGAATGATGGAAAAACGTTCAGGTTTATAGGCTTCT
ATGGCGAACCTAAGCCAGAAAATCGTCACCTATCTTGGACATTTCTTAGACGTCTTTCTGATTTCCCCCACCAGGCTTGGGTGGTAGCGGGAGATTTTAACTCTATTACC
AAAGAGAATGAAAAAGATGGCGGAAGTGCATATGATAGTAAAGAGAGTCAAAATTTCTTAAGCGCCATAAATGATTGTGCGTTGAAAGATCTTGGATTTTCAGGCAACCG
CTTCACATGGTTAAACAATCAACCTGGACAAGCTTGTATCAGGAAACGTTTGGACAGATGTATTGGTAACGTTGAACTTTTGGCTTTATATAATTTTGTTGTTATTAATC
TTCTTGGTTTTTTGTCCTCATATCACAGGGTCATTGAGCTTTACCTCCAAGAACAGAGAGGCTCGTTCCACAGGAAATTCACTCCTTTAAGATTCGAAGAGGCTTGGACC
CTCCACGAAGATTGTGCCTCCAAAATCCAACAGGGGTGGGATAAAGGAGCAGGGGACAATTCTCCTACGAATTTACTAACCCGTATCAAAAGTGTGTCTGAGGAACTCAA
GAGGTGGGGCAGAAACAAAACAGGAAAATTTAAGTCACGAATTGCAGAATCTAAAAGGAGAAGAGAGGCGTTGACGGAAGATAGGATGGATGTGAGGTCCTTGGTATGCC
TAAAGATGGAAGAAAAGCACCTAGATCATATCCTGCTGGAGGAGGAAATCTATTGGAAACAACGTTCTAGGGAGGATTGGCTAAAGTGGGGGGATCAAAATACAAAATGG
TTCCACTCTAAAGCTAGTCACCGTCGACAGAAAAATAAAATTCAAAAGATAGTTGACTCCAATGGAAATTGCCACGAAAACAGGGAAGATATTGAACGTCATTTCCTTCA
TTACTATTCTGATATGTTTAAATCTTCTCCTCTAAATGTGCAGGCAATCGAAGAAATTCTCCAAGCTACCACGGCTAGGTTGACAGATGCCATGAATGCACACTTAGATC
GGGTCTTCACAACGTCGGAAGTGGAGGAGGCCCTCAAGCAGATGCACCCAACAAAAGCCCCCGGTCCAGATGGGCTACCAGCCCTATTTTATCAAAAATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTGATGGAAACCAAGGCGAATCAAAGGCGAATGTTTTCTCTCAAGGATGCTCTTTCTTTTCAAAATGTTTTTGCAGTTGATTGTACTGGACTTAGTGGTGGTTT
GGCTTTATTTTGGCATTCTTCAATTACTCTTAATATTGTTTCTTATTCTAAAGTGCACATTGATACTATTATTGAGAATGATGGAAAAACGTTCAGGTTTATAGGCTTCT
ATGGCGAACCTAAGCCAGAAAATCGTCACCTATCTTGGACATTTCTTAGACGTCTTTCTGATTTCCCCCACCAGGCTTGGGTGGTAGCGGGAGATTTTAACTCTATTACC
AAAGAGAATGAAAAAGATGGCGGAAGTGCATATGATAGTAAAGAGAGTCAAAATTTCTTAAGCGCCATAAATGATTGTGCGTTGAAAGATCTTGGATTTTCAGGCAACCG
CTTCACATGGTTAAACAATCAACCTGGACAAGCTTGTATCAGGAAACGTTTGGACAGATGTATTGGTAACGTTGAACTTTTGGCTTTATATAATTTTGTTGTTATTAATC
TTCTTGGTTTTTTGTCCTCATATCACAGGGTCATTGAGCTTTACCTCCAAGAACAGAGAGGCTCGTTCCACAGGAAATTCACTCCTTTAAGATTCGAAGAGGCTTGGACC
CTCCACGAAGATTGTGCCTCCAAAATCCAACAGGGGTGGGATAAAGGAGCAGGGGACAATTCTCCTACGAATTTACTAACCCGTATCAAAAGTGTGTCTGAGGAACTCAA
GAGGTGGGGCAGAAACAAAACAGGAAAATTTAAGTCACGAATTGCAGAATCTAAAAGGAGAAGAGAGGCGTTGACGGAAGATAGGATGGATGTGAGGTCCTTGGTATGCC
TAAAGATGGAAGAAAAGCACCTAGATCATATCCTGCTGGAGGAGGAAATCTATTGGAAACAACGTTCTAGGGAGGATTGGCTAAAGTGGGGGGATCAAAATACAAAATGG
TTCCACTCTAAAGCTAGTCACCGTCGACAGAAAAATAAAATTCAAAAGATAGTTGACTCCAATGGAAATTGCCACGAAAACAGGGAAGATATTGAACGTCATTTCCTTCA
TTACTATTCTGATATGTTTAAATCTTCTCCTCTAAATGTGCAGGCAATCGAAGAAATTCTCCAAGCTACCACGGCTAGGTTGACAGATGCCATGAATGCACACTTAGATC
GGGTCTTCACAACGTCGGAAGTGGAGGAGGCCCTCAAGCAGATGCACCCAACAAAAGCCCCCGGTCCAGATGGGCTACCAGCCCTATTTTATCAAAAATACTAG
Protein sequenceShow/hide protein sequence
MFLMETKANQRRMFSLKDALSFQNVFAVDCTGLSGGLALFWHSSITLNIVSYSKVHIDTIIENDGKTFRFIGFYGEPKPENRHLSWTFLRRLSDFPHQAWVVAGDFNSIT
KENEKDGGSAYDSKESQNFLSAINDCALKDLGFSGNRFTWLNNQPGQACIRKRLDRCIGNVELLALYNFVVINLLGFLSSYHRVIELYLQEQRGSFHRKFTPLRFEEAWT
LHEDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRREALTEDRMDVRSLVCLKMEEKHLDHILLEEEIYWKQRSREDWLKWGDQNTKW
FHSKASHRRQKNKIQKIVDSNGNCHENREDIERHFLHYYSDMFKSSPLNVQAIEEILQATTARLTDAMNAHLDRVFTTSEVEEALKQMHPTKAPGPDGLPALFYQKY