; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017931 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017931
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:11832931..11834932
RNA-Seq ExpressionLag0017931
SyntenyLag0017931
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAM93462.1 putative reverse transcriptase [Oryza sativa Japonica Group]1.0e-5338.3Show/hide
Query:  NKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRV
        N T I  IPK ++P  L+E R  SLCNV+YKII+K LANRLK +L  II  TQSAFV GRLITDN+++ +E  H + N++  K G+A  KLDMSKAYDRV
Subjt:  NKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRV

Query:  EWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGI-------------------------------------KEVFDIYE
        EW FL   + +LGF E+W   IM CV +V + + +NG   E   P RGLRQG+ +                                     K++   YE
Subjt:  EWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGI-------------------------------------KEVFDIYE

Query:  KAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKM---------------------------
        +   Q  N  KS+ M + N+D  + +D  K++  K    +G   RDL+ FN AMLA+  WRLI+NPDSL +++                           
Subjt:  KAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKM---------------------------

Query:  LRGRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKG
        L+G  L  KG+ W++G+G  I I  DPW  P+    +  R+G
Subjt:  LRGRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKG

KAA3453673.1 reverse transcriptase [Gossypium australe]1.5e-5240Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LN   +V  +NKT I  +PK  +P  + +FR ISLCNVIYK+IAKA+AN+L+ +L   I   QS FVS RLITDN+++ +E +H + N+K  K G    K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAED
        LDMS AYDRV+W F+ + M K+GF   W+D +M+CV SV +SV+IN         K   R    +K++   YEK  EQ  N  KS    + N    +   
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAED

Query:  LSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDP
        +S+  G       G   R+L +FN A+LAK  WRLI  PDSL+AK+L+                            R L  KGM W++G G+KI I  D 
Subjt:  LSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDP

Query:  WCKPKSADHL
        W   K AD +
Subjt:  WCKPKSADHL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.0e-5332.32Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LNNG ++   N TYIA IPK + P+ + +FR ISLCNV YKII+K++ NRLK ++G +IS  QSAFV  R I+DN+++G EC+H IN+ K   +G A  K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGE-------------------------------------
        LD+SKA+DRVEW +L   M K+GF+E WI  I++C+ +V+FS+ +NG P   F P RG+RQG+                                     
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGE-------------------------------------

Query:  -----------------------GIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYL-----------------------
                                ++ + D Y +A  Q  N +KS  + + NV   + + L  +L +KL +  G+YL                       
Subjt:  -----------------------GIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYL-----------------------

Query:  ------------RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDPW
                    RDL  FN+A++AK  WR +++P+ L++K+L+                           GRDL  KG+R ++G+G+ I    DPW
Subjt:  ------------RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDPW

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]5.0e-5642.72Show/hide
Query:  MLNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATS
        +LN    + PLN TYI  IPKN  P+++ E+R ISLCNVIY ++AKA+ANRLK  L  IIS  QSAFV  RLITDNI++G+EC+H I + K  K      
Subjt:  MLNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATS

Query:  KLDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAE
        KLD+ KAYDRVEW+FL+  +++LGFS +WI+ IM C+ +  FSV+ING      +P+RGLRQG  +     +   A E K           K +   K E
Subjt:  KLDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAE

Query:  DLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDED
         LS     K+   +G   RD++ FN+A+LAK  WRL++NP+SL+AK+++                           GR +  KG RW+IG+G KI I   
Subjt:  DLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDED

Query:  PW
         W
Subjt:  PW

XP_035551144.1 uncharacterized protein LOC118349708 [Juglans regia]9.7e-5242.75Show/hide
Query:  LNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKK-SKVGWATSKLDMSKAYD
        LN T++  IPK + PK + +FR ISLCNV YKIIAK LANRLKK+L  +ISL QSAF+ GRLITDNI+  +E +H++  +KK    G    KLD+SKAYD
Subjt:  LNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKK-SKVGWATSKLDMSKAYD

Query:  RVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQ---------KTNLAKSICM-VNKNVDRSKA
        RVEW FLR  M KLGF E+W+  IM+CV++V +++++NG P  +  P RGLRQ   +    +I     E+         + N  K + + V KN +R + 
Subjt:  RVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQ---------KTNLAKSICM-VNKNVDRSKA

Query:  EDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLRGRDLFAKGMRWKIGSGNKIIIDEDPW
        +  +K+   K    +G   R+L  FN A+LAK  WR++  P S+ A+         +G+ W++G+G ++ I ED W
Subjt:  EDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLRGRDLFAKGMRWKIGSGNKIIIDEDPW

TrEMBL top hitse value%identityAlignment
A0A2N9FTN8 Uncharacterized protein7.2e-5337.01Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LN+G  +  +N TYI  IPK ++P+ ++EFR ISLCNVIYK+ +K LANRLKK+L  I+S +QSAFV GRLITDNI+V FE +H +++++K K G    K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEG-IKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAE
        LDMSKAYDRVEW +L++ M+K+GF  +W+  +M C+ +V +S+L+NG P     P R        I+ +   YE+A  Q+ N AK+    +K+  +    
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEG-IKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAE

Query:  DLSKLLGI---------------------------------KLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAK---------------------
         +  +LG+                                 K   + G  LR+L  FN+A+LAK  WRL+ N  SL  K                     
Subjt:  DLSKLLGI---------------------------------KLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAK---------------------

Query:  ------MLRGRDLFAKGMRWKIGSGNKIIIDEDPW
              +L  ++L  KG+ W++G+G+K+ I ED W
Subjt:  ------MLRGRDLFAKGMRWKIGSGNKIIIDEDPW

A0A2N9IE82 Uncharacterized protein4.7e-5238.94Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LN G  +  +N TYI  IPK Q P+ + +FR ISLCNVIYKII+K LANRLK +L  I+S +QSAFV GRLITDNI+V FE +H + ++ K  VG    K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIK--------EVFDIYEKAIEQ------------KT
        LDMSKAYDRVEW +L++ M+++GF  +W+  +M C+ +V +S+L+NG P     P RGL QG+ +         E  ++  K++ Q             T
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIK--------EVFDIYEKAIEQ------------KT

Query:  NLAKSICMV------NKNVDRSKAEDLS--KLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR------------------------
         L K I ++       ++ +++K   LS   L   K    IG   R+L  FN+A+LAK  WRL+ NP SL  K+ +                        
Subjt:  NLAKSICMV------NKNVDRSKAEDLS--KLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR------------------------

Query:  ---GRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLI
            RD+  KG  W++G+G+ I I  D W  P    HLI
Subjt:  ---GRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLI

A0A5B6U8V5 Reverse transcriptase7.2e-5340Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LN   +V  +NKT I  +PK  +P  + +FR ISLCNVIYK+IAKA+AN+L+ +L   I   QS FVS RLITDN+++ +E +H + N+K  K G    K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAED
        LDMS AYDRV+W F+ + M K+GF   W+D +M+CV SV +SV+IN         K   R    +K++   YEK  EQ  N  KS    + N    +   
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAED

Query:  LSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDP
        +S+  G       G   R+L +FN A+LAK  WRLI  PDSL+AK+L+                            R L  KGM W++G G+KI I  D 
Subjt:  LSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDP

Query:  WCKPKSADHL
        W   K AD +
Subjt:  WCKPKSADHL

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-5332.32Show/hide
Query:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK
        LNNG ++   N TYIA IPK + P+ + +FR ISLCNV YKII+K++ NRLK ++G +IS  QSAFV  R I+DN+++G EC+H IN+ K   +G A  K
Subjt:  LNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSK

Query:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGE-------------------------------------
        LD+SKA+DRVEW +L   M K+GF+E WI  I++C+ +V+FS+ +NG P   F P RG+RQG+                                     
Subjt:  LDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGE-------------------------------------

Query:  -----------------------GIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYL-----------------------
                                ++ + D Y +A  Q  N +KS  + + NV   + + L  +L +KL +  G+YL                       
Subjt:  -----------------------GIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYL-----------------------

Query:  ------------RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDPW
                    RDL  FN+A++AK  WR +++P+ L++K+L+                           GRDL  KG+R ++G+G+ I    DPW
Subjt:  ------------RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLR---------------------------GRDLFAKGMRWKIGSGNKIIIDEDPW

Q8LMV3 Putative reverse transcriptase5.0e-5438.3Show/hide
Query:  NKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRV
        N T I  IPK ++P  L+E R  SLCNV+YKII+K LANRLK +L  II  TQSAFV GRLITDN+++ +E  H + N++  K G+A  KLDMSKAYDRV
Subjt:  NKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRV

Query:  EWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGI-------------------------------------KEVFDIYE
        EW FL   + +LGF E+W   IM CV +V + + +NG   E   P RGLRQG+ +                                     K++   YE
Subjt:  EWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGI-------------------------------------KEVFDIYE

Query:  KAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKM---------------------------
        +   Q  N  KS+ M + N+D  + +D  K++  K    +G   RDL+ FN AMLA+  WRLI+NPDSL +++                           
Subjt:  KAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYLRDLNDFNKAMLAKVSWRLIKNPDSLMAKM---------------------------

Query:  LRGRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKG
        L+G  L  KG+ W++G+G  I I  DPW  P+    +  R+G
Subjt:  LRGRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.6e-1229.33Show/hide
Query:  IAFIPK-NQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA
        I  IPK  +   K E FR ISL N+  KI+ K LANR+++ +  +I   Q  F+ G     NI      I  I NR K K       +D  KA+D+++  
Subjt:  IAFIPK-NQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA

Query:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKE-----VFDIYEKAIEQKTNLAKSICMVNKNVDRSK-AEDLSKLLGI
        F+ + ++KLG    ++  I    +    ++++NG   EAF  K G RQG  +       V ++  +AI Q+  + K I +  + V  S  A+D+   L  
Subjt:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKE-----VFDIYEKAIEQKTNLAKSICMVNKNVDRSK-AEDLSKLLGI

Query:  KLTNSIGHYLRDLNDFNKAMLAKVS
         +  S  + L+ +++F+K    K++
Subjt:  KLTNSIGHYLRDLNDFNKAMLAKVS

P08548 LINE-1 reverse transcriptase homolog9.5e-1029.53Show/hide
Query:  IAFIPK-NQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA
        I  IPK  + P + E +R ISL N+  KI+ K L NR+++ +  II   Q  F+ G     NI      I  IN  K          +D  KA+D ++  
Subjt:  IAFIPK-NQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA

Query:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQG
        F+ + + K+G    ++  I         ++++NGV  ++F  + G RQG
Subjt:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQG

P11369 LINE-1 retrotransposable element ORF2 protein2.4e-1329.02Show/hide
Query:  IAFIPKNQA-PKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA
        I  IPK Q  P K+E FR ISL N+  KI+ K LANR+++ +  II   Q  F+ G     NI      IH IN  K          LD  KA+D+++  
Subjt:  IAFIPKNQA-PKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWA

Query:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKE-----VFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIK
        F+ + +++ G    +++ I         ++ +NG   EA   K G RQG  +       V ++  +AI Q+  + K I +  + V  S   D   +    
Subjt:  FLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKE-----VFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIK

Query:  LTNSIGHYLRDLNDFNKAMLAKVS
          NS    L  +N F + +  K++
Subjt:  LTNSIGHYLRDLNDFNKAMLAKVS

P14381 Transposon TX1 uncharacterized 149 kDa protein6.6e-1128.92Show/hide
Query:  KTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVE
        +  ++ +PK    + ++ +R +SL +  YKI+AKA++ RLK +L  +I   QS  V GR I DN+ +  + +H     +++ +  A   LD  KA+DRV+
Subjt:  KTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVE

Query:  WAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIE
          +L   +    F  +++  +     S +  V IN          RG+RQG  +     +Y  AIE
Subjt:  WAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIE

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.0e-1440.91Show/hide
Query:  LANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMR
        +  RLK ++  +I   Q++F+ GR+ TDNIV   E +H++  RKK   GW   KLD+ KAYDR+ W +L   +   GF E W+  I R
Subjt:  LANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDRVEWAFLRQAMDKLGFSERWIDRIMR

AT4G29090.1 Ribonuclease H-like superfamily protein7.3e-0525Show/hide
Query:  RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLRGR---------------------------DLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKGWE
        +D+  FN A+L K  WR++  P+SLMAK+ + R                           ++  +G R  +G+G  III    W   K A   +  +   
Subjt:  RDLNDFNKAMLAKVSWRLIKNPDSLMAKMLRGR---------------------------DLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKGWE

Query:  PMDF
        P ++
Subjt:  PMDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAACAATGGAGCGGAGGTGGGCCCCCTAAACAAGACCTATATCGCCTTCATTCCCAAAAATCAAGCTCCTAAAAAGCTAGAAGAATTCAGACTTATCAGCTTGTG
CAACGTCATTTACAAAATCATTGCAAAAGCACTAGCGAACAGGTTAAAGAAAATGCTCGGGACAATCATTTCGCTAACCCAATCAGCTTTTGTGTCGGGTAGGTTGATCA
CGGACAACATAGTAGTGGGATTTGAATGCATCCACGCTATTAATAATAGGAAGAAGAGTAAAGTTGGTTGGGCGACTTCGAAACTCGACATGAGCAAAGCTTATGACAGA
GTAGAGTGGGCCTTTCTCAGACAAGCTATGGACAAGCTAGGTTTCTCAGAGAGATGGATAGATAGAATTATGAGATGTGTGGAATCGGTTCAGTTCTCTGTTCTCATTAA
CGGAGTGCCCCAAGAAGCTTTCAATCCTAAAAGGGGGTTAAGGCAAGGGGAAGGGATAAAGGAGGTGTTTGATATCTATGAGAAGGCAATTGAGCAAAAGACCAATTTAG
CAAAGTCCATTTGTATGGTCAACAAGAATGTGGATCGATCGAAGGCGGAGGATCTAAGCAAGCTTCTGGGGATAAAGCTTACGAATTCTATAGGACACTACCTCAGAGAT
TTGAACGATTTCAACAAAGCAATGTTGGCTAAAGTGAGTTGGAGGTTGATCAAGAATCCCGATAGTTTGATGGCCAAAATGCTAAGAGGAAGAGATCTGTTTGCTAAGGG
GATGAGATGGAAAATAGGTTCAGGAAACAAGATTATTATCGATGAAGACCCTTGGTGTAAGCCGAAGTCTGCAGATCACCTAATCTGGAGGAAAGGATGGGAGCCAATGG
ATTTTTGGGAAGGATTGAATCGAAGTCTAGATGAAACAGACATAAACAAAGTTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAACAATGGAGCGGAGGTGGGCCCCCTAAACAAGACCTATATCGCCTTCATTCCCAAAAATCAAGCTCCTAAAAAGCTAGAAGAATTCAGACTTATCAGCTTGTG
CAACGTCATTTACAAAATCATTGCAAAAGCACTAGCGAACAGGTTAAAGAAAATGCTCGGGACAATCATTTCGCTAACCCAATCAGCTTTTGTGTCGGGTAGGTTGATCA
CGGACAACATAGTAGTGGGATTTGAATGCATCCACGCTATTAATAATAGGAAGAAGAGTAAAGTTGGTTGGGCGACTTCGAAACTCGACATGAGCAAAGCTTATGACAGA
GTAGAGTGGGCCTTTCTCAGACAAGCTATGGACAAGCTAGGTTTCTCAGAGAGATGGATAGATAGAATTATGAGATGTGTGGAATCGGTTCAGTTCTCTGTTCTCATTAA
CGGAGTGCCCCAAGAAGCTTTCAATCCTAAAAGGGGGTTAAGGCAAGGGGAAGGGATAAAGGAGGTGTTTGATATCTATGAGAAGGCAATTGAGCAAAAGACCAATTTAG
CAAAGTCCATTTGTATGGTCAACAAGAATGTGGATCGATCGAAGGCGGAGGATCTAAGCAAGCTTCTGGGGATAAAGCTTACGAATTCTATAGGACACTACCTCAGAGAT
TTGAACGATTTCAACAAAGCAATGTTGGCTAAAGTGAGTTGGAGGTTGATCAAGAATCCCGATAGTTTGATGGCCAAAATGCTAAGAGGAAGAGATCTGTTTGCTAAGGG
GATGAGATGGAAAATAGGTTCAGGAAACAAGATTATTATCGATGAAGACCCTTGGTGTAAGCCGAAGTCTGCAGATCACCTAATCTGGAGGAAAGGATGGGAGCCAATGG
ATTTTTGGGAAGGATTGAATCGAAGTCTAGATGAAACAGACATAAACAAAGTTGTCTGA
Protein sequenceShow/hide protein sequence
MLNNGAEVGPLNKTYIAFIPKNQAPKKLEEFRLISLCNVIYKIIAKALANRLKKMLGTIISLTQSAFVSGRLITDNIVVGFECIHAINNRKKSKVGWATSKLDMSKAYDR
VEWAFLRQAMDKLGFSERWIDRIMRCVESVQFSVLINGVPQEAFNPKRGLRQGEGIKEVFDIYEKAIEQKTNLAKSICMVNKNVDRSKAEDLSKLLGIKLTNSIGHYLRD
LNDFNKAMLAKVSWRLIKNPDSLMAKMLRGRDLFAKGMRWKIGSGNKIIIDEDPWCKPKSADHLIWRKGWEPMDFWEGLNRSLDETDINKVV