; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038682 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038682
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:23221067..23223899
RNA-Seq ExpressionLag0038682
SyntenyLag0038682
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.0e-8137.99Show/hide
Query:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        VG  T    L+ LN    +  WN TY+ALIPK   P+ ++DFRPISLCNV+YKII+K + NR+K V+  ++S+ QSAFVP R+I DNVI+ HECLHT+ +
Subjt:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN
         + G  G  ALK D+SK +DRVEWT+LE ++ KMGF+  W+  I+ C+ +  FSI  NG P G   P  G+RQG+P   +  Y    C +  S      N
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN

Query:  RSQELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTR
         S  L  +   +         FADDSLIF ++   EC  +R+LL  Y   S Q IN+ KSA++ SPNV  + Q YL  I+++ +VS+ G YL +PS FTR
Subjt:  RSQELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTR

Query:  RRGE---------------------DFR--------VIKQRVWQTLQG------------------------------------W------KGLRKRVGN
        RRGE                     +FR        ++ + VW+ LQ                                     W      KGLR RVGN
Subjt:  RRGE---------------------DFR--------VIKQRVWQTLQG------------------------------------W------KGLRKRVGN

Query:  GLAIDFFKDPWIPKETTFKPL------------------------------CRPNEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTVKSGYKLSMTDSV
        G  I  F DPW+P+ TTFKPL                              C  NED D+I+ S+PI++ + +D W+WHY   G Y+V+SGYKL M    
Subjt:  GLAIDFFKDPWIPKETTFKPL------------------------------CRPNEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTVKSGYKLSMTDSV

Query:  GQGSSSLN
           S+S N
Subjt:  GQGSSSLN

XP_023894138.1 uncharacterized protein LOC112006071 [Quercus suber]7.5e-6930.17Show/hide
Query:  IMTSEGSWTTDQKEMERSFEEYFQTIFSTSNPNAEAINMVEVGDVTTLNFLDI-----------------------------------------------
        I   E    TDQ+++  +F E+++ +F++S P   + ++  +  V T    +I                                               
Subjt:  IMTSEGSWTTDQKEMERSFEEYFQTIFSTSNPNAEAINMVEVGDVTTLNFLDI-----------------------------------------------

Query:  ----LNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGF
            LN     +  N+T++ LIPKT NP+ V++FRPISLC+V YKI +KVL NR+K VL  I+SE+QSAF+ GR I DN++V++E LH +KN   G+ GF
Subjt:  ----LNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGF

Query:  MALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRSQELDQ
        MALK DMSK YDRV+W+FL+ ++L+MGFD  WV ++M+C+ +  +SIL NG P+G+I P  G+RQG+P  P++ L     C +     H +I R+     
Subjt:  MALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRSQELDQ

Query:  VK------TALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGE
        +K              FFADDSL+F +AT++EC  + ++L  YE  S Q +N  K+A+  S + + + Q  + + + +S +     YL +P+   R +  
Subjt:  VK------TALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGE

Query:  DFRVIKQRVWQTLQGW-------------------------------------------------------------------------KGLRKRVGNGL
         F  IKQRVW+ LQGW                                                                         KG   RVG+G 
Subjt:  DFRVIKQRVWQTLQGW-------------------------------------------------------------------------KGLRKRVGNGL

Query:  AIDFFKDPWIPKETTFK---PLCRPNEDVDI--------------------------IISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK
         I  + D W+P +   K   P  R   +  +                          II +IP+++ S+ DK IW +T  G+YTVK GY+
Subjt:  AIDFFKDPWIPKETTFK---PLCRPNEDVDI--------------------------IISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]1.3e-6833.97Show/hide
Query:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        VGD      LD LN        N+TY+ LIPK  NP+ ++DFRPISLCNV YKII+KVLVNR+K VL +I+S  QSAFVPGR I DNV++++E LHT+ +
Subjt:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN
        +++G+KG+MALK D+SK YDRVEW FL+ ++ ++GF   W+  +M CV +T FS+L NG P GN+ P  G+RQG+P   +  Y    C + F+    L++
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN

Query:  RS---QELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPST
        R+   + L  V   + A       FADDSLIF++ +  E   I ++L+ Y   S QSIN  KS++  S N +   +     I+ +  VS    YL +P+ 
Subjt:  RS---QELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPST

Query:  FTRRRGEDFRVIKQRVWQTLQGWKGLRK------------------------------------------------------------------------
          R +   F  IK RVW+ LQGWKG+                                                                          
Subjt:  FTRRRGEDFRVIKQRVWQTLQGWKGLRK------------------------------------------------------------------------

Query:  -RVGNGLAIDFFKDPWIPKETTFKPLCRPNEDVDII-----------------------------ISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK---
         RVG+G +I+ ++D WIP+  T KPL    ED + +                             I  IP++     D   W +   G +TVKS YK   
Subjt:  -RVGNGLAIDFFKDPWIPKETTFKPLCRPNEDVDII-----------------------------ISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK---

Query:  -LSMTDSVGQGSSSLNSLRKW
         LS  +   + SS     R W
Subjt:  -LSMTDSVGQGSSSLNSLRKW

XP_030930869.1 uncharacterized protein LOC115956706 [Quercus lobata]1.7e-6836.98Show/hide
Query:  GDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNK
        GDV   + L  LN     +  N+T++ LIPK  NP+ VT +RPISLCNV YKI +KVL NR+K VL +I SE+QSAF+ GR I DN++V+ E LH +KN 
Subjt:  GDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNK

Query:  RRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINR
          G  G+  LK DMSK YDR+EW+FL+ ++ +MGF+  W++++M+C+ S  +SIL NG P G+I P  G+RQG+P   + LY    C +     H LI +
Subjt:  RRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINR

Query:  SQELDQV------KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTF
        + E   +      +        FFADDSL+F +AT +EC  +  +L  YE  S + +N  K+A+  S +  ++ Q  + E + ++ +     YL +P+  
Subjt:  SQELDQV------KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTF

Query:  TRRRGEDFRVIKQRVWQTLQGWKG-LRKRVGNGLAIDFFKDPWIPKETTFKPLCRP----------NEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTV
         R +   F  +KQR+W+ LQGW+G L  + G        ++     E    P+ +           NE     I +IP+ + ++ D  +W +T  G Y+V
Subjt:  TRRRGEDFRVIKQRVWQTLQGWKG-LRKRVGNGLAIDFFKDPWIPKETTFKPLCRP----------NEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTV

Query:  KSGYKLSMTDS
        KSGY+    +S
Subjt:  KSGYKLSMTDS

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.2e-6947.22Show/hide
Query:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        +G+  T   L  LN     SG N+T++ LIPK A+P  V DFRPISLCNV YKI++KV+ NR+K VL DI+S +QSAFVPGR I DNV++++E LH L+N
Subjt:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLI
        KR+GRKGFM+LK DMSK YDRV+W FLE+++  +GFD + +++IM CVR+  FS+L NG P G I+P  GLRQG+P  P++ L   +       LK    
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLI

Query:  NRSQE-LDQVKTALPFP---IYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTF
        N S+E +D ++     P      FADDS+IF KA V     I+ LL  YE  S Q IN  K++MV S NV+ D +  + ++   S      +YL  P   
Subjt:  NRSQE-LDQVKTALPFP---IYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTF

Query:  TRRRGEDFRVIKQRVWQTLQGWKG
         R + + F  IK+RVWQ LQ WKG
Subjt:  TRRRGEDFRVIKQRVWQTLQGWKG

TrEMBL top hitse value%identityAlignment
A0A2N9FJ03 Reverse transcriptase domain-containing protein6.6e-7134.8Show/hide
Query:  LDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFM
        L  LN  C +   N+T++ LIPK  NP+ VTDFRPISLCNV YK+++KVL NR+K +L  ++SE+QSAFVPGR I DNV+V+ E LH + + + GR+G M
Subjt:  LDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFM

Query:  ALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRSQELDQV
        ALK DMSK YDRVEW +L +++ KMGF  +W+ ++ +C+ +  +SIL NG P GNI+P  GLRQG+P  P++ L   +         H LI +++    +
Subjt:  ALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRSQELDQV

Query:  ------KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGED
              +        FFADDSL+F KAT+  C  I+ +LR YE  S Q +N  K+ +  S  V +  Q  + + + +S++    +YL +PS   R R E 
Subjt:  ------KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGED

Query:  FRVIKQRVWQTLQ-------------------------------GWKGLRK-----------RVGNGLAIDFFKDPWIPKE------TTFKPLCRPNEDV
        F  IK+RVW+ +                                 W+ + K           RVGNG  I  ++  W+ ++      T   P+ R +   
Subjt:  FRVIKQRVWQTLQ-------------------------------GWKGLRK-----------RVGNGLAIDFFKDPWIPKE------TTFKPLCRPNEDV

Query:  DII-----------------------ISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK--LSMTDSVGQGSSSLNSL
         +I                       I +IP+++ +  DK+ W  T++G Y+VKSGY+  + + +    GSS+ N L
Subjt:  DII-----------------------ISSIPINNCSEEDKWIWHYTSHGEYTVKSGYK--LSMTDSVGQGSSSLNSL

A0A2N9G5I8 Reverse transcriptase domain-containing protein1.1e-7039.47Show/hide
Query:  IMTSEGSWTTDQKEMERSFEEYFQTIFSTSNP-----NAEAINMV----------------------------------------------EVGDVTTLN
        I  ++G   T  + + R FEEYF ++F TSNP       E I+ V                                               VG+  +  
Subjt:  IMTSEGSWTTDQKEMERSFEEYFQTIFSTSNP-----NAEAINMV----------------------------------------------EVGDVTTLN

Query:  FLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGF
         L  LN    +   N+TY+ LIPK  NP  VTDFRPISLCNV YKI++KVL NR+K +L  I+SE QSAFVPGR I DN++V+ E LH +K +  G+ GF
Subjt:  FLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGF

Query:  MALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRS--QEL
        MALK DMSK YDRVEW FL+ ++LKMGF+ +WV+++M+C+ S  +SIL NG P G + P  GLRQG+P  P++ L   +  +    L H   N    + +
Subjt:  MALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF-PHICLYYVQRCYQHFSLKHWLINRS--QEL

Query:  DQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGEDFRV
           +        FFADDSL+F +A  EEC  +  +L  YEA S Q IN  K+ +  S +     Q  + +I+ + VV    +YL +PS   R R E F  
Subjt:  DQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGEDFRV

Query:  IKQRVWQTLQGWK
        IK+++WQ LQGWK
Subjt:  IKQRVWQTLQGWK

A0A5B7AER2 Reverse transcriptase domain-containing protein6.0e-7246.23Show/hide
Query:  GDVTTLNFLDILNCKC-SVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        GD+T +  LD LN    S+ G N TY+ LIPK  NP+ +++FRPISLCNV YKII+K+L NR+K +L DI+ E+QSAFVPGR I  N++V+ E +H LKN
Subjt:  GDVTTLNFLDILNCKC-SVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN
        +R+G+ G +ALK DMSK YDRVEW FLE ++LKMGF  +WV+++M CV +  FS+L NG P G I P  GLRQG+P            +     K    N
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN

Query:  RSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRG
        +   +   + A      FFADDSL+FAKA  ++   I +++  Y   S Q IN+ KS++  SPNV  D +  + +I+ +S+ S   +YL +PST  R + 
Subjt:  RSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRG

Query:  EDFRVIKQRVWQTLQGWK
        + F  IK RVW+ LQGWK
Subjt:  EDFRVIKQRVWQTLQGWK

A0A5B7BN08 Reverse transcriptase domain-containing protein4.1e-7345.91Show/hide
Query:  GDVTTLNFLDILNCKC-SVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        GD+T +  LD LN    S+   NYTY+ALIPK  +P+ +++FRPISLCNV YKII+K+L NR+K +L +I++E+QSAFVPGR I DN++V+ E +H LKN
Subjt:  GDVTTLNFLDILNCKC-SVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN
        KR+G+ G  ALK DMSK YDRVEW+FLE ++L+MGF  +WV++IM CV +  FS+L NG P G I P  GLRQG+P            +     K    N
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN

Query:  RSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRG
        +   +   + A      FFADDSL+FA AT  +   I +++  Y A S Q +N+ KSA+  S NV++D +  + +I+ +S+ S   +YL +PST  R + 
Subjt:  RSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRG

Query:  EDFRVIKQRVWQTLQGWK
        + F +I+ RVW+ L+GWK
Subjt:  EDFRVIKQRVWQTLQGWK

A0A6J1DX30 uncharacterized protein LOC1110248742.4e-8137.99Show/hide
Query:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN
        VG  T    L+ LN    +  WN TY+ALIPK   P+ ++DFRPISLCNV+YKII+K + NR+K V+  ++S+ QSAFVP R+I DNVI+ HECLHT+ +
Subjt:  VGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKN

Query:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN
         + G  G  ALK D+SK +DRVEWT+LE ++ KMGF+  W+  I+ C+ +  FSI  NG P G   P  G+RQG+P   +  Y    C +  S      N
Subjt:  KRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLIN

Query:  RSQELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTR
         S  L  +   +         FADDSLIF ++   EC  +R+LL  Y   S Q IN+ KSA++ SPNV  + Q YL  I+++ +VS+ G YL +PS FTR
Subjt:  RSQELDQV---KTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTR

Query:  RRGE---------------------DFR--------VIKQRVWQTLQG------------------------------------W------KGLRKRVGN
        RRGE                     +FR        ++ + VW+ LQ                                     W      KGLR RVGN
Subjt:  RRGE---------------------DFR--------VIKQRVWQTLQG------------------------------------W------KGLRKRVGN

Query:  GLAIDFFKDPWIPKETTFKPL------------------------------CRPNEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTVKSGYKLSMTDSV
        G  I  F DPW+P+ TTFKPL                              C  NED D+I+ S+PI++ + +D W+WHY   G Y+V+SGYKL M    
Subjt:  GLAIDFFKDPWIPKETTFKPL------------------------------CRPNEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTVKSGYKLSMTDSV

Query:  GQGSSSLN
           S+S N
Subjt:  GQGSSSLN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.5e-1622.52Show/hide
Query:  LALIPKTANPKVVTD-FRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGR-KGFMALKFDMSKTYDRVEW
        + LIPK        + FRPISL N++ KI+ K+L NR++  +K ++  +Q  F+PG     N+    + ++ +++  R + K  + +  D  K +D+++ 
Subjt:  LALIPKTANPKVVTD-FRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGR-KGFMALKFDMSKTYDRVEW

Query:  TFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINRSQELDQVKTAL-PFPIYFFADDS
         F+ + + K+G D  ++ II         +I+ NG  +     + G RQG P   +    V             I + +E+  ++       +  FADD 
Subjt:  TFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINRSQELDQVKTAL-PFPIYFFADDS

Query:  LIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPN---VSSDCQGYLSEIMSISVVSNLGRYL--DIPSTFTRRRGEDFRVIKQRVWQTLQGWK
        +++ +  +     + KL+ ++   S   IN +KS   +  N     S   G L   ++   +  LG  L  D+   F     E+++ + + + +    WK
Subjt:  LIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPN---VSSDCQGYLSEIMSISVVSNLGRYL--DIPSTFTRRRGEDFRVIKQRVWQTLQGWK

Query:  GL
         +
Subjt:  GL

P08548 LINE-1 reverse transcriptase homolog2.1e-1823.43Show/hide
Query:  LALIPKTA-NPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNV-----IVSHECLHTLKNKRRGRKGFMALKFDMSKTYD
        + LIPK   +P    ++RPISL N++ KI+ K+L NR++  +K I+  +Q  F+PG     N+     ++ H  ++ LKN     K  M L  D  K +D
Subjt:  LALIPKTA-NPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNV-----IVSHECLHTLKNKRRGRKGFMALKFDMSKTYD

Query:  RVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINRSQ-ELDQVKTALPFPIYFF
         ++  F+ + + K+G +  ++ +I         +I+ NGV + +   + G RQG P   +    V         +   I       +++K +L      F
Subjt:  RVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINRSQ-ELDQVKTALPFPIYFF

Query:  ADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRR--RGEDFRVIKQRVWQTLQGW
        ADD +++ + T +    + +++++Y   S   IN  KS   +  N ++  +  + + +  +VV    +YL +  T   +    E++  +++ + + +  W
Subjt:  ADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRR--RGEDFRVIKQRVWQTLQGW

Query:  KGL
        K +
Subjt:  KGL

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-1726.07Show/hide
Query:  LALIPK-TANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWT
        + LIPK   +P  + +FRPISL N++ KI+ K+L NR++  +K I+  +Q  F+PG     N+  S   +H + NK +  K  M +  D  K +D+++  
Subjt:  LALIPK-TANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWT

Query:  FLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFP----HICLYYVQRCY-QHFSLKHWLINRSQELDQVKTALPFPIYFFA
        F+ +++ + G    ++N+I         +I  NG  +  I  + G RQG P      +I L  + R   Q   +K   I +    ++VK +L       A
Subjt:  FLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFP----HICLYYVQRCY-QHFSLKHWLINRSQELDQVKTALPFPIYFFA

Query:  DDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKS-AMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRR--RGEDFRVIKQRVWQTLQGW
        DD +++          +  L+  +       IN  KS A + + N  ++ +  + E    S+V+N  +YL +  T   +    ++F+ +K+ + + L+ W
Subjt:  DDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKS-AMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRR--RGEDFRVIKQRVWQTLQGW

Query:  KGL
        K L
Subjt:  KGL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-1624.37Show/hide
Query:  EYFQTIFSTSNPNAEAI--NMVEVGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSA
        E+FQ  + T  P+   +     + G++        L+C+ +V       L+L+PK  + +++ ++RP+SL + +YKI+AK +  R+K VL +++  +QS 
Subjt:  EYFQTIFSTSNPNAEAI--NMVEVGDVTTLNFLDILNCKCSVSGWNYTYLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSA

Query:  FVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF
         VPGR+I DNV +  + LH     RR       L  D  K +DRV+  +L   +    F  ++V  +     S    +  N      +    G+RQG P 
Subjt:  FVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPF

Query:  ----------PHICLYYVQRCYQHFSLKHWLINRSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSS
                  P +CL   +           L+ +  ++  V +A    +   A D +   +A  +EC  +      Y A S   IN+ KS+ ++  ++  
Subjt:  ----------PHICLYYVQRCYQHFSLKHWLINRSQELDQVKTALPFPIYFFADDSLIFAKATVEECWMIRKLLRDYEATSDQSINYRKSAMVVSPNVSS

Query:  DC--QGYLSEIMSISVVSNLGRYLDIPSTFTRRRGEDFRVIKQRVWQTLQGWKGLRK
        D     +        ++  LG YL   S       ++F  +++ V   L  WKG  K
Subjt:  DC--QGYLSEIMSISVVSNLGRYLDIPSTFTRRRGEDFRVIKQRVWQTLQGWKGLRK

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.6e-1436.17Show/hide
Query:  LVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTF
        +V R+K ++ +++   Q++F+PGR   DN++   E +H+++ K +G KG+M LK D+ K YDR+ W +LE  ++  GF   W   + +  RSTF
Subjt:  LVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWTFLEQLVLKMGFDCRWVNIIMDCVRSTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACCGATCACACCCCAAGATGATTGGTGGGGACTTCAAACCTTTAACCTCGAAAGAGACCCTTAGGAAAGGATGTCGGCCGCTCCGCGTCGCGATGACG
TTGCAACAGCCTATAATGACATCCGAGGGGTCATGGACTACTGACCAAAAAGAAATGGAAAGATCTTTTGAAGAATATTTTCAAACTATCTTTTCGACATCTAAC
CCAAATGCAGAGGCTATTAATATGGTTGAGGTTGGTGATGTTACGACTCTTAATTTTTTAGATATTTTAAACTGTAAGTGCTCTGTTAGTGGGTGGAATTATACA
TATTTAGCTTTGATCCCCAAGACGGCTAATCCAAAGGTAGTGACAGATTTTCGACCTATAAGCTTATGCAATGTCAACTACAAAATCATTGCTAAGGTTTTGGTG
AACAGAATGAAAGGCGTCCTTAAGGATATTGTCTCGGAGAATCAATCTGCTTTTGTCCCAGGTCGCTCAATTCATGACAATGTTATTGTAAGCCATGAATGTTTA
CATACTCTGAAAAATAAAAGGCGTGGAAGAAAGGGGTTTATGGCCTTAAAATTTGATATGAGCAAGACCTATGATCGAGTGGAATGGACTTTCTTGGAACAGTTG
GTGCTGAAAATGGGATTTGATTGTCGTTGGGTCAATATTATAATGGACTGCGTGCGTTCAACTTTTTTCTCAATCTTGTTTAACGGCGTACCTATAGGAAATATT
GTTCCTCAGTGTGGACTGAGGCAGGGGAACCCCTTTCCCCATATTTGTTTATATTATGTGCAGAGGTGTTATCAGCACTTTTCTCTGAAGCATTGGCTAATCAAC
AGATCTCAGGAATTAGACCAGGTAAAAACTGCCCTACCATTTCCCATTTATTTTTTCGCTGATGATAGTTTGATTTTTGCAAAGGCAACGGTAGAGGAATGTTGG
ATGATTCGAAAATTACTTCGTGATTATGAAGCAACGTCAGACCAATCTATAAATTATAGGAAATCGGCGATGGTTGTTTCTCCAAATGTTTCATCTGATTGTCAA
GGATATCTTTCTGAGATCATGTCTATTTCAGTTGTCTCCAATTTGGGGAGATATCTCGACATTCCTTCTACATTTACTAGACGTAGAGGAGAGGATTTTCGGGTC
ATAAAGCAAAGAGTTTGGCAGACACTACAAGGATGGAAAGGTCTGAGGAAAAGGGTTGGTAATGGACTTGCTATTGATTTCTTTAAGGATCCATGGATCCCTAAG
GAGACAACATTCAAACCACTATGTAGACCAAATGAGGATGTTGATATAATTATCTCATCTATACCAATCAACAATTGTAGTGAGGAAGATAAATGGATATGGCAT
TATACTTCGCATGGCGAATACACAGTTAAAAGTGGATACAAGTTGTCCATGACTGATTCCGTTGGGCAAGGTTCATCTAGTTTGAATTCACTTAGGAAATGGTTC
GTTGTGAATGGATTCACAATTACTGGAATGAGGTTCGTGGTAATTCATAAACCCAATTCATCAATTATTGATAAGGTAGATTGTGGATTAACTGAAGAGTATAGA
GACATGTGTGTGTTCACTGATGCAGCAGTAAATACCAATAGTACTGAAACAGGCTATAGTATCCTGATTTTAGACCAAAATCACAAAATTTATGGGGCAATGAAA
ATGTTTGACAAAATGCCAATATCGCCTCTGGAGGAGAGGTTCAGGCCATTTTACAGGGACTCCGATTACTTCAAAGACTTAACATTTTGGGAGTACCTTTGTTCA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACCGATCACACCCCAAGATGATTGGTGGGGACTTCAAACCTTTAACCTCGAAAGAGACCCTTAGGAAAGGATGTCGGCCGCTCCGCGTCGCGATGACG
TTGCAACAGCCTATAATGACATCCGAGGGGTCATGGACTACTGACCAAAAAGAAATGGAAAGATCTTTTGAAGAATATTTTCAAACTATCTTTTCGACATCTAAC
CCAAATGCAGAGGCTATTAATATGGTTGAGGTTGGTGATGTTACGACTCTTAATTTTTTAGATATTTTAAACTGTAAGTGCTCTGTTAGTGGGTGGAATTATACA
TATTTAGCTTTGATCCCCAAGACGGCTAATCCAAAGGTAGTGACAGATTTTCGACCTATAAGCTTATGCAATGTCAACTACAAAATCATTGCTAAGGTTTTGGTG
AACAGAATGAAAGGCGTCCTTAAGGATATTGTCTCGGAGAATCAATCTGCTTTTGTCCCAGGTCGCTCAATTCATGACAATGTTATTGTAAGCCATGAATGTTTA
CATACTCTGAAAAATAAAAGGCGTGGAAGAAAGGGGTTTATGGCCTTAAAATTTGATATGAGCAAGACCTATGATCGAGTGGAATGGACTTTCTTGGAACAGTTG
GTGCTGAAAATGGGATTTGATTGTCGTTGGGTCAATATTATAATGGACTGCGTGCGTTCAACTTTTTTCTCAATCTTGTTTAACGGCGTACCTATAGGAAATATT
GTTCCTCAGTGTGGACTGAGGCAGGGGAACCCCTTTCCCCATATTTGTTTATATTATGTGCAGAGGTGTTATCAGCACTTTTCTCTGAAGCATTGGCTAATCAAC
AGATCTCAGGAATTAGACCAGGTAAAAACTGCCCTACCATTTCCCATTTATTTTTTCGCTGATGATAGTTTGATTTTTGCAAAGGCAACGGTAGAGGAATGTTGG
ATGATTCGAAAATTACTTCGTGATTATGAAGCAACGTCAGACCAATCTATAAATTATAGGAAATCGGCGATGGTTGTTTCTCCAAATGTTTCATCTGATTGTCAA
GGATATCTTTCTGAGATCATGTCTATTTCAGTTGTCTCCAATTTGGGGAGATATCTCGACATTCCTTCTACATTTACTAGACGTAGAGGAGAGGATTTTCGGGTC
ATAAAGCAAAGAGTTTGGCAGACACTACAAGGATGGAAAGGTCTGAGGAAAAGGGTTGGTAATGGACTTGCTATTGATTTCTTTAAGGATCCATGGATCCCTAAG
GAGACAACATTCAAACCACTATGTAGACCAAATGAGGATGTTGATATAATTATCTCATCTATACCAATCAACAATTGTAGTGAGGAAGATAAATGGATATGGCAT
TATACTTCGCATGGCGAATACACAGTTAAAAGTGGATACAAGTTGTCCATGACTGATTCCGTTGGGCAAGGTTCATCTAGTTTGAATTCACTTAGGAAATGGTTC
GTTGTGAATGGATTCACAATTACTGGAATGAGGTTCGTGGTAATTCATAAACCCAATTCATCAATTATTGATAAGGTAGATTGTGGATTAACTGAAGAGTATAGA
GACATGTGTGTGTTCACTGATGCAGCAGTAAATACCAATAGTACTGAAACAGGCTATAGTATCCTGATTTTAGACCAAAATCACAAAATTTATGGGGCAATGAAA
ATGTTTGACAAAATGCCAATATCGCCTCTGGAGGAGAGGTTCAGGCCATTTTACAGGGACTCCGATTACTTCAAAGACTTAACATTTTGGGAGTACCTTTGTTCA
TAG
Protein sequenceShow/hide protein sequence
MGDRSHPKMIGGDFKPLTSKETLRKGCRPLRVAMTLQQPIMTSEGSWTTDQKEMERSFEEYFQTIFSTSNPNAEAINMVEVGDVTTLNFLDILNCKCSVSGWNYT
YLALIPKTANPKVVTDFRPISLCNVNYKIIAKVLVNRMKGVLKDIVSENQSAFVPGRSIHDNVIVSHECLHTLKNKRRGRKGFMALKFDMSKTYDRVEWTFLEQL
VLKMGFDCRWVNIIMDCVRSTFFSILFNGVPIGNIVPQCGLRQGNPFPHICLYYVQRCYQHFSLKHWLINRSQELDQVKTALPFPIYFFADDSLIFAKATVEECW
MIRKLLRDYEATSDQSINYRKSAMVVSPNVSSDCQGYLSEIMSISVVSNLGRYLDIPSTFTRRRGEDFRVIKQRVWQTLQGWKGLRKRVGNGLAIDFFKDPWIPK
ETTFKPLCRPNEDVDIIISSIPINNCSEEDKWIWHYTSHGEYTVKSGYKLSMTDSVGQGSSSLNSLRKWFVVNGFTITGMRFVVIHKPNSSIIDKVDCGLTEEYR
DMCVFTDAAVNTNSTETGYSILILDQNHKIYGAMKMFDKMPISPLEERFRPFYRDSDYFKDLTFWEYLCS