; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015720 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015720
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:21485151..21487390
RNA-Seq ExpressionLag0015720
SyntenyLag0015720
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU39028.1 hypothetical protein TSUD_59840 [Trifolium subterraneum]2.6e-8040.05Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        MNC+ +V++  LVNG P   FKP RG+RQGDPLSPY+F+LC   LS ++ +   N+ + GI++A+N+P +THL F DDSL+F  A++ E   I  +L+TY
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGIK---------------------------QRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        + A+ Q+++L+KS    S+N+   +   IC  I +K                           +R+WK ++GWK K  S+ GKE LIK VAQAIP Y M+
Subjt:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGIK---------------------------QRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        C+KLP   CD+I  + A+FWWG++E ++KI+W+SW KL ++K +GGLGFR    FNKALL KQ WR+L+NP SLLAKV K +YFP   F+ A +G  PS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNILWGQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSA
         WR++         +    +D+  +      ++     +W   W   +  + K  +W  V++++P+A
Subjt:  TWRNILWGQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSA

PNY16580.1 ribonuclease H, partial [Trifolium pratense]1.4e-8140.6Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        MNC+ +V++  LVNG P   FKP RG+RQGDPLSPY+F+LC   LSG+L +   N+ + G+++A+N+P +THL F DDSL+F  A++ E   I  +L+TY
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGIK---------------------------QRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        + A+ Q+++L+KS    S+N+   +   IC  I +K                           +R+WK ++GWK K  S+ GKE LIK VAQAIP Y M+
Subjt:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGIK---------------------------QRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        C+KLP   CD+I  + A+FWWG++E ++KI+W+SW KL ++K +GGLGFR    FNKALL KQ WR+L+NP SLLAKV K +YFP  +F+ A +G  PS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNILWGQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSA
         WR++         +    +D+  +      K+     +W   W   +  + K  +W  V+++ P+A
Subjt:  TWRNILWGQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSA

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.0e-8049.51Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        MNCV SV++S L+NG       P+RG+RQGDPLSP LFLLC EGLS L+     N  ++GI I +  P +THLFF DDSL+F +A E E   +  +L  Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPK------------------KGLAICKAIG---------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E+A+ Q IN DKS+   S N S +                  K L +   IG         +K RV K L GWKGKL S GG+E+LIK VAQA+PTYTM+
Subjt:  EKATRQMINLDKSTCMLSQNVSPK------------------KGLAICKAIG---------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LPK +C D+  L   FWWG  + + KI W+SW+K+CRSK  GG+GFR+I  FN A+LAKQ WRIL NP+SL+A+V K KYFP  + L ++ G+NPS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNI
         WR+I
Subjt:  TWRNI

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]2.6e-8047.57Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M+C+ S +FS ++NG      KP RGLRQGDPLSPYLFL+C EGLS LL+ EES   L G+R+ +++P+++HL F DDSL+F EAS      +K+VLETY
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIG---------------------------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
          A+ Q++N  KS    S N S        + +G                           +K+R+W+ L  W  KLFS GGKEVL+K V Q+IPTY M+
Subjt:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIG---------------------------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LP   C  +  + A FWWGS++   KI+W SW+ LC+SKF GG+GFR    FNKALLAKQ+WRIL+ P+SLL+++LK +YF N  FL+A+LG +PS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNILWGQ
        TW+ I WG+
Subjt:  TWRNILWGQ

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]8.9e-8138.44Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        + C+ +VT+S L+NGS +    P+RG+RQGDPLSPYLFL+C EGLS LL+ EES   L G++I++N+P+++HLFF DDS++F  A+    R I + L+TY
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKK--------GLAI--C---------------KAI--GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
         +A+ Q+IN DK     S N    +        G+ I  C               K +  GI  ++WK+L  WK  LFS GGKE+L+K V QAIPTY M+
Subjt:  EKATRQMINLDKSTCMLSQNVSPKK--------GLAI--C---------------KAI--GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LP  +C  I  + A FWWGSS   K I+W +W  LC++K +GGLGFR+   FN+ALLAKQ+WR+++ P+SLL+K+L+ +YF NG FL + LG+NPS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNI-------------------------------------------------LW-----GQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNL
        TWR++                                                 +W     G + VK  Y L V    + + + S     E  W   W +
Subjt:  TWRNI-------------------------------------------------LW-----GQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNL

Query:  NIYPRAKISIWKIVNDILPSASNL
         + P+ +I +WK+ +  LP A+ L
Subjt:  NIYPRAKISIWKIVNDILPSASNL

TrEMBL top hitse value%identityAlignment
A0A2N9G497 Reverse transcriptase domain-containing protein7.9e-8339.62Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M CV++ ++S L+NG P    KP RG+RQGDPLSPYLFLLC EGLS LL R E    + G+ + +N P ++HL F DDSL+F +A+E E  ++ +VL  Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGI---------------------------KQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E+A+ QM+N +K+    S+N       AI +  G+                           K+R+ + LQGWK +L S+ G+ +LIK +AQAIPTYTM+
Subjt:  EKATRQMINLDKSTCMLSQNVSPKKGLAICKAIGI---------------------------KQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CFKLPK  C DIN L + +WWG    + KI+WI+W +LC  K  GG+GFRDI  FN ALLAKQ WR++  P SL AK  + KYFP   FLKA+LG+NPS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNIL--------------------------WGQFT-------------------------------VKRSYNLTVDINLKREASGSKEEC-TEALWKA
         WR+IL                          WGQ T                               VK +Y + V  +L  EA GS ++   + +WK 
Subjt:  TWRNIL--------------------------WGQFT-------------------------------VKRSYNLTVDINLKREASGSKEEC-TEALWKA

Query:  VWNLNIYPRAKISIWKIVNDILPS
        +W L I  + K  IW+   + LP+
Subjt:  VWNLNIYPRAKISIWKIVNDILPS

A0A2N9GRT8 Reverse transcriptase domain-containing protein2.4e-8440.54Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M C+ +V +S L++G P+    P RG+RQGDPLSPY+FLLC EGLS +L++   +  L G+++ +  P ++HLFF DDSL+F +A+  E  ++ ++L  Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPKKGLAI-------------------------CKAI--GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E+++ Q IN+DK+    S N +    LAI                          K+I  G+K+R+ + LQGWK +  S+ G+E+LIK VAQAIPTY M 
Subjt:  EKATRQMINLDKSTCMLSQNVSPKKGLAI-------------------------CKAI--GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LPK+ CD++N L AR+WWG  + ++K++WI W KLC +K  GGLGFR+++ FN ALLAKQ WRIL  P SL  +V K +YFPN  F+ AQLG+NPS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNILWGQ----------------------------FTVKRSYNLTVDINLKREASGSKEECTEA-----LWKAVWNLNIYPRAKISIWKIVNDILPSA
         WR+ LWG+                            FTVK +Y++ ++   KRE SG   EC+ +     LW+  W L I  + K  +W+  ++ LP+ 
Subjt:  TWRNILWGQ----------------------------FTVKRSYNLTVDINLKREASGSKEECTEA-----LWKAVWNLNIYPRAKISIWKIVNDILPSA

Query:  SNLKNDK
          L   K
Subjt:  SNLKNDK

A0A2N9HDH5 Uncharacterized protein1.6e-8341Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M C+ +V +S +++G P+    P+RG+RQGDP+SPYLFLLC EGLS LLK+      L G++ ++  P ++HLFF DDSL+F +AS  E R+  ++L  Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSP------------------KKGLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E+++ Q +N +K+    S N S                    K L +   I         G+K+R+ + LQGWK K  S+ G+E+LIK VAQAIPTYTM 
Subjt:  EKATRQMINLDKSTCMLSQNVSP------------------KKGLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LPK  CD++N L A++WWG ++ K+KI+W+ W KLC +K  GG+GFR++ +FN ALL+KQ WR+L+N  SL  +V K KYFP   FL AQLG++PS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNIL-----------------------W-----GQFTVKRSYNLTVDINLKREASG--SKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSASNL
         WR+ L                       W     G FTVK +Y L ++  ++ E+ G  S  +    LW+ VW L I  + K  IW+  +D LP++ NL
Subjt:  TWRNIL-----------------------W-----GQFTVKRSYNLTVDINLKREASG--SKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSASNL

A0A2N9HWG1 Reverse transcriptase domain-containing protein2.4e-8441.5Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M C+ +V +S L++G P+    P+RG+RQGDP+SPYLFLLC EGLS LLK+      L G++I++  P ++HLFF DDSL+F +AS  E R+  ++L  Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSP------------------KKGLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E+++ Q +N +K+    S N S                    K L +   I         G+K+R+ + LQGWK K  S+ G+E+LIK VAQAIPTYTM 
Subjt:  EKATRQMINLDKSTCMLSQNVSP------------------KKGLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF+LPK  CD++N L A++WWG ++ K+KI+W+ W KLC +K  GG+GFR++ +FN ALL+KQ WR+L+N  SL  +V K KYFP   FL AQLG++PS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNIL-----------------------W-----GQFTVKRSYNLTVDINLKREASG--SKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSASNL
         WR+ L                       W     G FTVK +Y L ++  ++ E+ G  S  +    LW+ VW L I  + K  IW+  +D LP++ NL
Subjt:  TWRNIL-----------------------W-----GQFTVKRSYNLTVDINLKREASG--SKEECTEALWKAVWNLNIYPRAKISIWKIVNDILPSASNL

A0A7N2L6Z9 Reverse transcriptase domain-containing protein5.4e-8451.15Show/hide
Query:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY
        M C+ SV++S ++NG       P RGLRQGDPLSPYLFLLC EGLS LL     N LL+GI + +  P +THLFF DDSL+F +A+  E  ++K++LE Y
Subjt:  MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETY

Query:  EKATRQMINLDKSTCMLSQNVSPK------------------KGLAICKAIG---------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT
        E A+ Q +N DKS+   S N +P+                  K L +   IG         IK+RV   L GWKGKL S GGKE+LIK VAQAIPTYTM+
Subjt:  EKATRQMINLDKSTCMLSQNVSPK------------------KGLAICKAIG---------IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMT

Query:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA
        CF LPK++CD++ ++   FWWG    + K+ WISW+K+C+ K  GGLGFR++  FN ALLAKQ+WRIL NP SL A++LK KYFP G+ L A LG+NPS 
Subjt:  CFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSA

Query:  TWRNI
        TWR+I
Subjt:  TWRNI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.3e-2036Show/hide
Query:  IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLA
        I +RV   + GW+ K  S  G+  L K V  ++P ++M+   LP++I + +++L   F WGS+  KKK + + W K+C  K  GGLG R     N+AL++
Subjt:  IKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLA

Query:  KQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQL---GANPSATWRNILWG
        K  WR+L+  +SL   VL+ KY   GE   ++      + S+TWR+I  G
Subjt:  KQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQL---GANPSATWRNILWG

P11369 LINE-1 retrotransposable element ORF2 protein7.7e-1121.88Show/hide
Query:  VNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETYEKATRQMINLDK
        VNG   E      G RQG PLSPYLF +  E L+  +++++    + GI+I K    ++ L   DD +++    +   R++  ++ ++ +     IN +K
Subjt:  VNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETYEKATRQMINLDK

Query:  STCML-SQNVSPKK------------------GLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIK--VVAQAIPTYTMTCFKLPKNIC
        S   L ++N   +K                  G+ + K +          +K+ + + L+ WK    S  G+  ++K  ++ +AI  +     K+P    
Subjt:  STCML-SQNVSPKK------------------GLAICKAI---------GIKQRVWKVLQGWKGKLFSQGGKEVLIK--VVAQAIPTYTMTCFKLPKNIC

Query:  DDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSW
        +++     +F W + + +     I+   L   +  GG+   D+ L+ +A++ K +W
Subjt:  DDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSW

P92555 Uncharacterized mitochondrial protein AtMg012507.2e-1757.35Show/hide
Query:  LVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDS
        ++NG+PQ    P+RGLRQGDPLSPYLF+LC E LSGL +R +    L GIR++ NSP + HL F DD+
Subjt:  LVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDS

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-3145.74Show/hide
Query:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSK-FRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLK
        A+P Y M+CF+L K +C  +      FWW S E K+KI W++WQKLC+SK   GGLGFRD+  FN+ALLAKQS+RI+  P +LL+++L+ +YFP+   ++
Subjt:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSK-FRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLK

Query:  AQLGANPSATWRNILWGQFTVKRSYNLTV
          +G  PS  WR+I+ G+  + R    T+
Subjt:  AQLGANPSATWRNILWGQFTVKRSYNLTV

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein8.4e-2948.67Show/hide
Query:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKA
        A+PTYTM CF LPK +C  I  + A FWW + +  K ++W +W  L   K  GG+GF+DI  FN ALL KQ WR+L  P SL+AKV K +YF   + L A
Subjt:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKA

Query:  QLGANPSATWRNI
         LG+ PS  W++I
Subjt:  QLGANPSATWRNI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-3245.74Show/hide
Query:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSK-FRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLK
        A+P Y M+CF+L K +C  +      FWW S E K+KI W++WQKLC+SK   GGLGFRD+  FN+ALLAKQS+RI+  P +LL+++L+ +YFP+   ++
Subjt:  AIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSK-FRGGLGFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLK

Query:  AQLGANPSATWRNILWGQFTVKRSYNLTV
          +G  PS  WR+I+ G+  + R    T+
Subjt:  AQLGANPSATWRNILWGQFTVKRSYNLTV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.1e-1857.35Show/hide
Query:  LVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDS
        ++NG+PQ    P+RGLRQGDPLSPYLF+LC E LSGL +R +    L GIR++ NSP + HL F DD+
Subjt:  LVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGTGTTGAGTCGGTGACTTTCTCGACACTGGTGAATGGTTCGCCTCAAGAGGAATTCAAACCAAACAGAGGCCTCAGGCAAGGAGACCCACTCTCCCCTTATCT
CTTTCTCCTTTGTGGGGAAGGGTTATCGGGCCTTTTGAAAAGGGAAGAATCCAACTCTCTCTTATCTGGTATTCGTATTGCTAAAAATAGTCCTACCCTAACTCACCTCT
TTTTTGTAGATGACAGTTTAATTTTTTTCGAAGCTTCAGAAGGAGAAGGAAGACAAATCAAGAAAGTGCTAGAGACTTATGAAAAAGCCACAAGGCAAATGATCAATCTA
GACAAGTCAACGTGTATGTTAAGCCAAAATGTTTCCCCGAAAAAAGGCTTAGCTATCTGCAAGGCAATTGGGATTAAGCAGAGGGTTTGGAAAGTCTTGCAAGGATGGAA
GGGCAAACTTTTTTCTCAAGGAGGAAAAGAAGTTCTCATTAAAGTCGTAGCGCAAGCAATTCCAACATACACTATGACTTGTTTTAAGCTCCCGAAGAACATTTGTGATG
ACATAAACAGATTGTGTGCACGGTTTTGGTGGGGTTCGTCGGAGGGTAAGAAAAAGATCTACTGGATTAGCTGGCAAAAGCTTTGCAGAAGTAAGTTTAGGGGAGGTTTG
GGTTTCAGGGATATAACTCTTTTCAACAAGGCCCTATTGGCTAAGCAAAGCTGGAGGATATTAAAAAATCCATCTAGCCTTTTAGCCAAGGTCCTAAAAGGGAAATATTT
CCCTAATGGCGAGTTCCTTAAAGCTCAACTCGGTGCCAATCCTTCAGCCACCTGGAGAAACATTCTGTGGGGCCAATTTACAGTAAAGAGGAGCTACAACCTGACAGTCG
ATATCAACTTGAAGAGGGAGGCTTCGGGGTCTAAGGAAGAATGCACAGAAGCTCTGTGGAAGGCGGTTTGGAACCTCAATATCTATCCAAGGGCAAAAATTTCCATTTGG
AAAATAGTGAATGACATTCTTCCTAGTGCCTCTAATCTAAAGAACGATAAGCAAAATCTAGGCGGAATCGGATGGTCCATTCGTGACTCAGAGGAGTATCTAATTGGAGT
AGGCTGCAAGCAATTTCCCAAGAGATGGTCTATAAAAAATTTGGAAGGAAAGGCAATTATGGAAGGGATGGAAGCTTACCAGTCCCTCATCAAAGGTTCCAGAGTCAAGA
TCCGGCTCGAGGTTGAAAGCGACTCGACGGAATTTTTGGCAGCGATCTCCGATTCGACGAACGATCTCTCGGAAACAAAGCTCCTTGTCCAGGCGATCAAGGAGCTATCA
TCCGAAATTAGAATCTCTTTCTCTTTCTGTCCAAGATCGAGAAACAAATTGGTCCATAATCTCGCTCGTGCTGTTGGCAAATTTGGGATTTTTTTTCATTTTTTTGTCCC
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGTGTTGAGTCGGTGACTTTCTCGACACTGGTGAATGGTTCGCCTCAAGAGGAATTCAAACCAAACAGAGGCCTCAGGCAAGGAGACCCACTCTCCCCTTATCT
CTTTCTCCTTTGTGGGGAAGGGTTATCGGGCCTTTTGAAAAGGGAAGAATCCAACTCTCTCTTATCTGGTATTCGTATTGCTAAAAATAGTCCTACCCTAACTCACCTCT
TTTTTGTAGATGACAGTTTAATTTTTTTCGAAGCTTCAGAAGGAGAAGGAAGACAAATCAAGAAAGTGCTAGAGACTTATGAAAAAGCCACAAGGCAAATGATCAATCTA
GACAAGTCAACGTGTATGTTAAGCCAAAATGTTTCCCCGAAAAAAGGCTTAGCTATCTGCAAGGCAATTGGGATTAAGCAGAGGGTTTGGAAAGTCTTGCAAGGATGGAA
GGGCAAACTTTTTTCTCAAGGAGGAAAAGAAGTTCTCATTAAAGTCGTAGCGCAAGCAATTCCAACATACACTATGACTTGTTTTAAGCTCCCGAAGAACATTTGTGATG
ACATAAACAGATTGTGTGCACGGTTTTGGTGGGGTTCGTCGGAGGGTAAGAAAAAGATCTACTGGATTAGCTGGCAAAAGCTTTGCAGAAGTAAGTTTAGGGGAGGTTTG
GGTTTCAGGGATATAACTCTTTTCAACAAGGCCCTATTGGCTAAGCAAAGCTGGAGGATATTAAAAAATCCATCTAGCCTTTTAGCCAAGGTCCTAAAAGGGAAATATTT
CCCTAATGGCGAGTTCCTTAAAGCTCAACTCGGTGCCAATCCTTCAGCCACCTGGAGAAACATTCTGTGGGGCCAATTTACAGTAAAGAGGAGCTACAACCTGACAGTCG
ATATCAACTTGAAGAGGGAGGCTTCGGGGTCTAAGGAAGAATGCACAGAAGCTCTGTGGAAGGCGGTTTGGAACCTCAATATCTATCCAAGGGCAAAAATTTCCATTTGG
AAAATAGTGAATGACATTCTTCCTAGTGCCTCTAATCTAAAGAACGATAAGCAAAATCTAGGCGGAATCGGATGGTCCATTCGTGACTCAGAGGAGTATCTAATTGGAGT
AGGCTGCAAGCAATTTCCCAAGAGATGGTCTATAAAAAATTTGGAAGGAAAGGCAATTATGGAAGGGATGGAAGCTTACCAGTCCCTCATCAAAGGTTCCAGAGTCAAGA
TCCGGCTCGAGGTTGAAAGCGACTCGACGGAATTTTTGGCAGCGATCTCCGATTCGACGAACGATCTCTCGGAAACAAAGCTCCTTGTCCAGGCGATCAAGGAGCTATCA
TCCGAAATTAGAATCTCTTTCTCTTTCTGTCCAAGATCGAGAAACAAATTGGTCCATAATCTCGCTCGTGCTGTTGGCAAATTTGGGATTTTTTTTCATTTTTTTGTCCC
CTGA
Protein sequenceShow/hide protein sequence
MNCVESVTFSTLVNGSPQEEFKPNRGLRQGDPLSPYLFLLCGEGLSGLLKREESNSLLSGIRIAKNSPTLTHLFFVDDSLIFFEASEGEGRQIKKVLETYEKATRQMINL
DKSTCMLSQNVSPKKGLAICKAIGIKQRVWKVLQGWKGKLFSQGGKEVLIKVVAQAIPTYTMTCFKLPKNICDDINRLCARFWWGSSEGKKKIYWISWQKLCRSKFRGGL
GFRDITLFNKALLAKQSWRILKNPSSLLAKVLKGKYFPNGEFLKAQLGANPSATWRNILWGQFTVKRSYNLTVDINLKREASGSKEECTEALWKAVWNLNIYPRAKISIW
KIVNDILPSASNLKNDKQNLGGIGWSIRDSEEYLIGVGCKQFPKRWSIKNLEGKAIMEGMEAYQSLIKGSRVKIRLEVESDSTEFLAAISDSTNDLSETKLLVQAIKELS
SEIRISFSFCPRSRNKLVHNLARAVGKFGIFFHFFVP