; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:25927265..25928625
RNA-Seq ExpressionLag0025975
SyntenyLag0025975
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.2e-7743.05Show/hide
Query:  CKKL---------SKFEEVWTKFSSTRDIIRKVWE----EDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTK---------DGS
        CKKL           +E++W+ + +  +I+R  WE          V  ++    R +  L  W+KE   G       +K+K+ E++ +            
Subjt:  CKKL---------SKFEEVWTKFSSTRDIIRKVWE----EDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTK---------DGS

Query:  DWDKIRKSERELEELLEEEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDS
        D ++IRK E ++  +L +EE YWKQRSR DWL  GD+NTK+FH+KA  RRR+NKI G+ D  G+WV+  E +      +F  LF SS P     +  L  
Subjt:  DWDKIRKSERELEELLEEEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDS

Query:  INPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCN
        + P+VS++ N  L   F+  +I R + +M P+KAPG DGL A F+Q +W +VG      CL ILN Q +L+SLN T IA IPKV  P+K+ +FRPISLCN
Subjt:  INPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCN

Query:  VSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        V Y+I+AK +AN+LK IL+ IISP+QSAF+P R I DNV IG+EC+H I   +  + GL+A+KLD+SKAYDRVE
Subjt:  VSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]1.5e-7845.85Show/hide
Query:  FEEVWTKFSSTRDIIRKVWEEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDWDK---IRKSERELEELLEEEERYWKQ
        FE +WT+    R++I + W+  +R        +   C + L  WN+  + G++   + +K+ +++ L       DK   I+  ++E+ E L +EE  WKQ
Subjt:  FEEVWTKFSSTRDIIRKVWEEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDWDK---IRKSERELEELLEEEERYWKQ

Query:  RSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDRE
        RSR  WL  GDRNTK+FHA A QRRR+N+I G+ +  G WVE  E + R+ LDYF+ +FKS  P   D  + L +I+ RVSE+ N DL   F   E+   
Subjt:  RSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDRE

Query:  IKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPS
        ++ M P+KAPG DG+  +FYQ YW++V  +     L +LN       +N T I  IPKVHSP+K+ +FRPISLCNV YKII+KVLAN+LKG+L  +I  S
Subjt:  IKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPS

Query:  QSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        QSAFVPGR I+DNV + FE +H I  R+KG++ LMA+KLDMSKAYDRVE
Subjt:  QSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.1e-8045.3Show/hide
Query:  KFEEVWTKFSSTRDIIRKVWEEDQRVRVV-GWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT---KDGSDWDKIRKSERELEELLEEEERYW
        +FE +WT+    +DII+ VW     V    G  A+   C + L  WNK  + G+I   I +K++ +  L    ++GS   +I    +E+ ELL+ EE  W
Subjt:  KFEEVWTKFSSTRDIIRKVWEEDQRVRVV-GWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT---KDGSDWDKIRKSERELEELLEEEERYW

Query:  KQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEID
        +QRSR  WL  GDRNTK+FH KA  RRRRN I+GI D NG+W +  E + +VA+ YF  ++ SS+  P   + +LD+I   V+E+ N  L + F++ EI+
Subjt:  KQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEID

Query:  REIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIIS
          +  M P+KAPG DG+ A+F+Q YW++VGN+   + L +LN   S+  +N+T I  +PK+ +P KM DFRPISLCNV YK+I+KVLAN+LK IL  IIS
Subjt:  REIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIIS

Query:  PSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
         +QSAF+ GR I DNV + FE +H +  +++G++G  A+KLDMSKAYDRVE
Subjt:  PSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.0e-7440.85Show/hide
Query:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQ-RVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT---KDGSDWDKIRKSERELEELLEEE
        K+   FE +WTK     D+I   W      V   G  +   RC   L++WN + + G+I   I +K + +  +T   + G+    I +  +EL +LL+ E
Subjt:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQ-RVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT---KDGSDWDKIRKSERELEELLEEE

Query:  ERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSK
        E  W+QRS+  W   GDRNTK+FHA+A +RR++N I  +W+ +G W +  E +   AL YF N++ SS   P     ++++I  RV+++ N +L+++F+ 
Subjt:  ERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSK

Query:  AEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILD
         E+ + +K + P+KAPG DG+ A F+ NYWD+VG    N+ L +LN    +  +N+T I+ IPK + P +M +FRPISLCN +YKII+KVLAN+ K IL 
Subjt:  AEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILD

Query:  SIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        +IIS +QSAF P R I DNV + FE +H +N + +G++  M++KLDMSKA+DRVE
Subjt:  SIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.6e-7544.54Show/hide
Query:  CKKLSKFEEVWTKFSSTRDIIRKVW--EEDQRVRVVGWEAKTGRCIKELLAWNKERLN---GSIKATISKKEKEIEILTKDGSDWDKIRKSERELEELLE
        C +  KFEE W        +I++ W   +  R  +   + K   C  EL+AW     +   G+IK    + ++  E    + S  + +  S +++++LL+
Subjt:  CKKLSKFEEVWTKFSSTRDIIRKVW--EEDQRVRVVGWEAKTGRCIKELLAWNKERLN---GSIKATISKKEKEIEILTKDGSDWDKIRKSERELEELLE

Query:  EEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSF
        ++E YW QRSR +WL  GDRNTK+FHAKA QRRR+N I GI +S G WVE  EEVG+VA DYF+NLF++           LD+++ +V+E     L+  F
Subjt:  EEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSF

Query:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGI
        +  E+   +  M P+KAPG DG+ ALFYQ +W +VG+   +  L  LN    L  +N T I  IPKV +P++M +FRPISLCNV YKII+KVLAN+LK +
Subjt:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGI

Query:  LDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        L  IIS +QSAFVPGR I DNV + +E +H +++R+KG+KG +A+KLD+SKAYDRVE
Subjt:  LDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

TrEMBL top hitse value%identityAlignment
A0A2N9F7A6 Uncharacterized protein2.8e-7844.54Show/hide
Query:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQRVRVVGW----EAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT--KDGSDWDKIRKSERELEELLE
        KK+ +FE +WTK    R +I K W ED  ++   W      K  +C   L+AW++ER  GSI A+I  K ++++  T   +     ++ + + EL  LLE
Subjt:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQRVRVVGW----EAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT--KDGSDWDKIRKSERELEELLE

Query:  EEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSF
        +EE +W+QRSR  W+N GD+NTK+FHA   QRR+ N I G++D +  W     ++  +A+ YF N+F SS P        L+ +   V+   N +L   F
Subjt:  EEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSF

Query:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGI
        ++ E+   ++ M P+KAPG DG+ A+FYQ YW+VVG E     L I++    L+ +N T IA +PK+ SP+K+ DFRPI+LCNV YKII+KVLAN+LK  
Subjt:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGI

Query:  LDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        L  I+S SQSAFVPGR I DNV + FE +H+++ +R G+KG MA+KLDMSKAYDRVE
Subjt:  LDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

A0A2N9GNK5 Reverse transcriptase domain-containing protein3.1e-7746.44Show/hide
Query:  KFEEVWTKFSSTRDIIRKVWEEDQR-VRVVGWEAKTGRCIKELLAWNKERLNGSIKATI---SKKEKEIEILTKDGSDWDKIRKSERELEELLEEEERYW
        +FEE W         I+K WE  QR  R+     K   C K+L  W+++   GSIK  I    +K ++ E L     +   I    REL  LL +EE+ W
Subjt:  KFEEVWTKFSSTRDIIRKVWEEDQR-VRVVGWEAKTGRCIKELLAWNKERLNGSIKATI---SKKEKEIEILTKDGSDWDKIRKSERELEELLEEEERYW

Query:  KQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEID
        +QRSR  WL  GDRNTK+FH +A QR+RRN I  + D +G W+E +EE+  + +DY+  LF +S   P +    +  +   V+ + N  L R F   E++
Subjt:  KQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEID

Query:  REIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIIS
          IK M PSKAPG DG+  +FYQ YW VVGN+  +  L  LN    L S+N T I+ IPKV +P+K+ DFRPISLCNV YK+++KVLAN+LK IL  IIS
Subjt:  REIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIIS

Query:  PSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
         SQSAFVPGR I DNV I FE +H ++  + G++G MA+KLDMSKAYDRVE
Subjt:  PSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

A0A2N9GPZ7 Reverse transcriptase domain-containing protein5.2e-7744.41Show/hide
Query:  KKLSKFEEVWTKFSSTRDIIRKVW-----EEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDW--DKIRKSERELEELL
        KKL +FE +W K    R++I   W     E      VV    K   C   L+ W++ER  GS+ ++I +K ++++ L  +        I + + +L  LL
Subjt:  KKLSKFEEVWTKFSSTRDIIRKVW-----EEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDW--DKIRKSERELEELL

Query:  EEEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARS
        E+EE +W+QRSR  W++ GD+NTK+FHA+  +RRR N I G+ D +G W     ++  +A+DYF  +F SS P  +  T +L  +   V+   N  L   
Subjt:  EEEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARS

Query:  FSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKG
        F+K E+   +K M P+KAPG DG+ A+FYQ YWD+VG E     L IL+    L  +N T IA IPKV +P+ + DFRPISLCNV YKI++KVLAN+LK 
Subjt:  FSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKG

Query:  ILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        +L  +IS +QSAFVPGR I DNV + FE +H+++ +RKG+KG MA+KLDMSKAYDRVE
Subjt:  ILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

A0A2N9GQ35 Reverse transcriptase domain-containing protein1.1e-7940.75Show/hide
Query:  NHEKEGRAERRESVMEDFLEAIDYYKLMDLGFEGC-------------------------------------KKLSKFEEVWTKFSSTRDIIRKVWEEDQ
        N E+ G   R    + DF EA+ Y +L DLGF G                                      KK+ +FE +WTK    R +I K W ED 
Subjt:  NHEKEGRAERRESVMEDFLEAIDYYKLMDLGFEGC-------------------------------------KKLSKFEEVWTKFSSTRDIIRKVWEEDQ

Query:  R--VRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDW--DKIRKSERELEELLEEEERYWKQRSREDWLNWGDRNTKWFHAKAF
        R   R+     K  +C   L+AW++ER  GS+ A+I  K ++++  T         ++ + + EL  LLE+EE +W+QRSR  W++ GD+NTK+FHA   
Subjt:  R--VRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILTKDGSDW--DKIRKSERELEELLEEEERYWKQRSREDWLNWGDRNTKWFHAKAF

Query:  QRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQN
        QRR+ N I G++D +  W     ++  +A+ YF N+F SS P        L+ +   V+   N +L   F++ E+   ++ M P+KAPG DG+ A+FYQ 
Subjt:  QRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQN

Query:  YWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIH
        YW+VVG E     L I++    L+ +N T IA +PK+ SP+K+ DFRPI+LCNV YKII+KVLAN+LK IL  I+S SQSAFVPGR I DNV + FE +H
Subjt:  YWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIH

Query:  AINSRRKGQKGLMAMKLDMSKAYDRVE
        +++ +R G+KG MA+KLDMSKAYDRVE
Subjt:  AINSRRKGQKGLMAMKLDMSKAYDRVE

A0A7N2LIH6 Uncharacterized protein1.5e-7943.91Show/hide
Query:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEK--EIEILTKDGSDWDKIRKSERELEELLEEEER
        KK   FEE+WT+    ++I+   W+  +    +  + +  RC K L  WN+       K    KK +  ++E L       ++I+  ++E+ EL   EE 
Subjt:  KKLSKFEEVWTKFSSTRDIIRKVWEEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEK--EIEILTKDGSDWDKIRKSERELEELLEEEER

Query:  YWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAE
         WKQRSR  WL +GD+N+K+FHA A QRR++N+I G+ D  G W E  E   ++ LDYF +++ S+ P   D +  L++++ RV+ + N +L + F   E
Subjt:  YWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAE

Query:  IDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSI
        + + ++ M P+KAPG DG+  +FYQ YWD+VG+   N  LQ LN       +N+T I  IPK  +P+K+ +FRPISLCNV YKII+KVLAN+LK +L  +
Subjt:  IDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSI

Query:  ISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE
        I  +QSAFVPGR I DNV + FE +H+IN RRKG++GLMA+KLDMSKAYDRVE
Subjt:  ISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.4e-2027.74Show/hide
Query:  WEAKTGRCIKELLAWN--KERLNGSIKATISKKEKEIEILTK---DGSDWDKIRKSERELEELLEEEERYWKQRSREDW----LNWGDRNTKWFHAKAFQ
        W+A    C  + +A N  K +   S   T++ + KE+E   +     S   +I K   EL+E +E ++   K      W    +N  DR       K   
Subjt:  WEAKTGRCIKELLAWN--KERLNGSIKATISKKEKEIEILTK---DGSDWDKIRKSERELEELLEEEERYWKQRSREDW----LNWGDRNTKWFHAKAFQ

Query:  RRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQN
        +R +N+ID I +  G       E+     +Y+ +L+ + +   ++    LD+   PR+++++   L R  + +EI   I  +   K+PG DG  A FYQ 
Subjt:  RRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQN

Query:  YWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECI
        Y + +      L   I       NS     I  IPK      K E+FRPISL N+  KI+ K+LAN+++  +  +I   Q  F+PG Q   N+      I
Subjt:  YWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECI

Query:  HAINSRRKGQKGLMAMKLDMSKAYDRVE
          IN  R   K  + + +D  KA+D+++
Subjt:  HAINSRRKGQKGLMAMKLDMSKAYDRVE

P08548 LINE-1 reverse transcriptase homolog9.7e-2027.95Show/hide
Query:  QRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQ
        ++R ++ I  I + N        E+ ++  +Y+  L+       ++  + L++ + PR+S+K+   L R  S +EI   I+++   K+PG DG  + FYQ
Subjt:  QRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQ

Query:  NYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFEC
         + + +     NL   I       N+     I  IPK    P + E++RPISL N+  KI+ K+L N+++  +  II   Q  F+PG Q   N+      
Subjt:  NYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKV-HSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFEC

Query:  IHAINSRRKGQKGLMAMKLDMSKAYDRVE
        I  IN  +   K  M + +D  KA+D ++
Subjt:  IHAINSRRKGQKGLMAMKLDMSKAYDRVE

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-2130.63Show/hide
Query:  IDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVG
        I+ I +  G      EE+      ++  L+ + +    +  + LD    P++++ Q   L    S  EI+  I  +   K+PG DG  A FYQ + + + 
Subjt:  IDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSIN-PRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVG

Query:  NETANLCLQILNGQESLNSLNRTVIAFIPKVH-SPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSR
             L  +I       NS     I  IPK    P K+E+FRPISL N+  KI+ K+LAN+++  + +II P Q  F+PG Q   N+      IH IN  
Subjt:  NETANLCLQILNGQESLNSLNRTVIAFIPKVH-SPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSR

Query:  RKGQKGLMAMKLDMSKAYDRVE
        +   K  M + LD  KA+D+++
Subjt:  RKGQKGLMAMKLDMSKAYDRVE

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-2729.41Show/hide
Query:  KATISKKEKEIEILTKD---------GSDWDKIRKSERELEELLEEEERYWKQ----RSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEG
        K+   ++  EIE L  +         GS+   ++    E +E L   E+   +    RSR   L   DR +++F+A   ++  R +I  ++  +G+ +E 
Subjt:  KATISKKEKEIEILTKD---------GSDWDKIRKSERELEELLEEEERYWKQ----RSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEG

Query:  DEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQE
         E +   A  ++ NLF      P     + D + P VSE++   L    +  E+ + ++ M  +K+PG+DGL   F+Q +WD +G +   +  +     E
Subjt:  DEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVSEKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQE

Query:  SLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSK
           S  R V++ +PK    + ++++RP+SL +  YKI+AK ++ +LK +L  +I P QS  VPGR I DNV +  + +H   +RR G   L  + LD  K
Subjt:  SLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSK

Query:  AYDRVE
        A+DRV+
Subjt:  AYDRVE

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM5.2e-0529.8Show/hide
Query:  EIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDS
        E D     +  S +PG DG+     +     +     NL L   N   S+  L RTV  FIPK  + K+ +DFRPIS+ +V  + +  +LA +L   ++ 
Subjt:  EIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKGILDS

Query:  IISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYD
           P Q  F+P     DN  I       +    K  +      LD+SKA+D
Subjt:  IISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYD

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-1631.22Show/hide
Query:  ERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLF--KSSMPCPQDTTRILDSINPRVSEKQNRDLARSF
        E +++Q+SR  WL  GD NT++FH      + +N I  +   +   VE   +V  + + Y+ +L    S +  P    RI D    R ++     L+   
Subjt:  ERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLF--KSSMPCPQDTTRILDSINPRVSEKQNRDLARSF

Query:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKII
        S  EI   +  M  +KAPG D   A F+   W VV + T     +       L   N T I  IPKV    ++  FRP+S C V YKII
Subjt:  SKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.2e-1042.19Show/hide
Query:  LANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRV
        +  +LK ++ ++I P+Q++F+PGR   DN+    E +H++  R+KG KG M +KLD+ KAYDR+
Subjt:  LANKLKGILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAACCATGAGAAGGAAGGTAGAGCTGAGAGGAGGGAATCTGTTATGGAAGACTTTCTTGAAGCTATAGATTACTACAAGTTAATGGACCTGGGGTTTGAGGGCTG
TAAAAAATTATCAAAATTTGAGGAGGTTTGGACAAAGTTTTCCTCCACCAGAGACATCATTAGAAAGGTTTGGGAAGAAGACCAAAGGGTGAGGGTTGTAGGCTGGGAAG
CTAAAACTGGGAGGTGTATAAAAGAACTCCTGGCGTGGAACAAAGAAAGACTCAATGGTAGTATCAAAGCAACCATATCTAAAAAAGAGAAGGAAATTGAGATTCTAACA
AAAGATGGGAGCGACTGGGACAAGATTAGGAAGTCTGAGAGGGAGCTTGAAGAGTTGCTGGAGGAAGAAGAGCGCTATTGGAAACAACGTTCAAGGGAGGACTGGCTGAA
TTGGGGAGACAGAAATACAAAGTGGTTCCATGCTAAGGCGTTTCAAAGAAGAAGGAGGAACAAAATTGATGGTATTTGGGATTCGAATGGCTCCTGGGTTGAGGGGGATG
AGGAAGTTGGAAGAGTAGCTCTAGACTACTTTAACAACCTGTTTAAATCCTCTATGCCTTGCCCTCAAGATACCACTAGAATCCTAGACAGCATAAACCCAAGAGTTTCA
GAGAAGCAGAACCGTGACCTTGCAAGGTCGTTTTCAAAAGCGGAAATTGACAGAGAGATTAAAGATATGAAACCTTCTAAAGCGCCTGGTATGGATGGCCTTCAAGCCCT
TTTCTACCAAAATTACTGGGACGTGGTTGGCAATGAGACTGCAAATCTTTGCCTTCAAATCCTCAACGGGCAAGAAAGTTTAAATAGCTTAAATAGAACAGTCATAGCGT
TTATTCCCAAGGTTCACAGTCCTAAGAAGATGGAGGATTTCAGACCAATAAGCCTTTGCAACGTCAGTTATAAAATTATTGCAAAGGTTCTAGCCAACAAACTTAAAGGT
ATCCTAGACTCGATTATTTCTCCCTCCCAATCAGCTTTTGTCCCAGGGAGACAAATTTTAGACAACGTGGCTATTGGTTTTGAATGTATTCACGCAATCAACTCCAGAAG
AAAGGGTCAGAAGGGTCTCATGGCTATGAAACTCGACATGAGCAAAGCTTATGACCGAGTCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTACAACCATGAGAAGGAAGGTAGAGCTGAGAGGAGGGAATCTGTTATGGAAGACTTTCTTGAAGCTATAGATTACTACAAGTTAATGGACCTGGGGTTTGAGGGCTG
TAAAAAATTATCAAAATTTGAGGAGGTTTGGACAAAGTTTTCCTCCACCAGAGACATCATTAGAAAGGTTTGGGAAGAAGACCAAAGGGTGAGGGTTGTAGGCTGGGAAG
CTAAAACTGGGAGGTGTATAAAAGAACTCCTGGCGTGGAACAAAGAAAGACTCAATGGTAGTATCAAAGCAACCATATCTAAAAAAGAGAAGGAAATTGAGATTCTAACA
AAAGATGGGAGCGACTGGGACAAGATTAGGAAGTCTGAGAGGGAGCTTGAAGAGTTGCTGGAGGAAGAAGAGCGCTATTGGAAACAACGTTCAAGGGAGGACTGGCTGAA
TTGGGGAGACAGAAATACAAAGTGGTTCCATGCTAAGGCGTTTCAAAGAAGAAGGAGGAACAAAATTGATGGTATTTGGGATTCGAATGGCTCCTGGGTTGAGGGGGATG
AGGAAGTTGGAAGAGTAGCTCTAGACTACTTTAACAACCTGTTTAAATCCTCTATGCCTTGCCCTCAAGATACCACTAGAATCCTAGACAGCATAAACCCAAGAGTTTCA
GAGAAGCAGAACCGTGACCTTGCAAGGTCGTTTTCAAAAGCGGAAATTGACAGAGAGATTAAAGATATGAAACCTTCTAAAGCGCCTGGTATGGATGGCCTTCAAGCCCT
TTTCTACCAAAATTACTGGGACGTGGTTGGCAATGAGACTGCAAATCTTTGCCTTCAAATCCTCAACGGGCAAGAAAGTTTAAATAGCTTAAATAGAACAGTCATAGCGT
TTATTCCCAAGGTTCACAGTCCTAAGAAGATGGAGGATTTCAGACCAATAAGCCTTTGCAACGTCAGTTATAAAATTATTGCAAAGGTTCTAGCCAACAAACTTAAAGGT
ATCCTAGACTCGATTATTTCTCCCTCCCAATCAGCTTTTGTCCCAGGGAGACAAATTTTAGACAACGTGGCTATTGGTTTTGAATGTATTCACGCAATCAACTCCAGAAG
AAAGGGTCAGAAGGGTCTCATGGCTATGAAACTCGACATGAGCAAAGCTTATGACCGAGTCGAATGA
Protein sequenceShow/hide protein sequence
MYNHEKEGRAERRESVMEDFLEAIDYYKLMDLGFEGCKKLSKFEEVWTKFSSTRDIIRKVWEEDQRVRVVGWEAKTGRCIKELLAWNKERLNGSIKATISKKEKEIEILT
KDGSDWDKIRKSERELEELLEEEERYWKQRSREDWLNWGDRNTKWFHAKAFQRRRRNKIDGIWDSNGSWVEGDEEVGRVALDYFNNLFKSSMPCPQDTTRILDSINPRVS
EKQNRDLARSFSKAEIDREIKDMKPSKAPGMDGLQALFYQNYWDVVGNETANLCLQILNGQESLNSLNRTVIAFIPKVHSPKKMEDFRPISLCNVSYKIIAKVLANKLKG
ILDSIISPSQSAFVPGRQILDNVAIGFECIHAINSRRKGQKGLMAMKLDMSKAYDRVE