; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009464 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009464
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:39482616..39483650
RNA-Seq ExpressionLag0009464
SyntenyLag0009464
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKA56961.1 Putative ribonuclease H protein [Apostasia shenzhenica]1.8e-6841.94Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL QIHP+KAPGPDG   +F+++ W ++GEDV++  L +LN  +S   +N T IVLIPK      +  FR ISLCNV YK+I+K L N++K +LSS+I  
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIV-------------------------------------------------------GCISSIRFSFNFNGIRCGDVKPSQGLRQ
        NQSAF+P R + DN+IV                                                        CI+++ ++ +FNG   G++ P +GLRQ
Subjt:  NQSAFIPGRCVVDNVIV-------------------------------------------------------GCISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDPLSPY FL+C EGLS++++  E +R + G+R SRR P ++HLFFADDSLLFFRARVEEA+ IQ I++ YEKAS Q +N+DKS + FS NT  + +  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L+V+       YLGLP  + R++  +   IKDR+ ++I
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

XP_010682933.1 PREDICTED: uncharacterized protein LOC104897695 [Beta vulgaris subsp. vulgaris]7.3e-7042.23Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+HPNKAPG DGM   FY++ W +VG+D++       N +   G LN T IVLIPK  NP+++ +FR ISLC V YK++SK++ N++K  LS LI  
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNV-------------------------------------------------------IVGCISSIRFSFNFNGIRCGDVKPSQGLRQ
        +QSAF+PGR + DN                                                        I+ C+SS+ +SF  NG   G++ PS+GLRQ
Subjt:  NQSAFIPGRCVVDNV-------------------------------------------------------IVGCISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDPLSPYLFLLC E  SA+L  A     I G R+ R  P +SHLFFADDS+LF RA ++E  V+ +IL  YE+AS Q +NFDKS V+FS N DD  +  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +  VR    H+ YLGLPT + RS+      +K+RVW+K+
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

XP_015386480.1 uncharacterized protein LOC107177332 [Citrus sinensis]2.4e-6840.12Show/hide
Query:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSL
        +  A+  +HP+K+ GPDGM  AFY+Q W+VVG+DV   CL  + +++ P GLN T IVL+PK++NP ++ + R I+LCNV YK+I+K+L N++K +L  +
Subjt:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSL

Query:  IVQNQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQG
        I ++QSAF+PGR + DNV++                                                        C+S + +S  ++    G + PS+G
Subjt:  IVQNQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQG

Query:  LRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQ
        LRQGD LSPYLFLLC EGLS ++ + E S  + G RI      +SHLFFADD  LFFRA  EEA++I++IL  Y  AS Q VNF+KS+++FS NT   + 
Subjt:  LRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQ

Query:  GSVNGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
         SV  +L +++T  H  YLGLP+ + R++  + +FIKDRVW ++
Subjt:  GSVNGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

XP_027118730.1 uncharacterized protein LOC113735973 [Coffea arabica]1.9e-7342.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL+Q+HP K+PGPDGM   FY++ W +VG DV R  L VLN    P  LN T +VLIPK  +P  +++FR ISLCNV YKL SK+L N+++  L  +I  
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAFIPGRC+ DN+++                                                        CIS++ ++F  NG + G+++P +G+RQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDPLSPYLFL+C EG S +L +A+   SIS  RI  RGP LSHL FADD+LLF +A ++EA+ I+ IL  Y+ AS Q VNFDKS V FS NTD   + S+
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L+++  P H  YLGLPT +  S+      +K+R+W+KI
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]1.8e-6843.7Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+HP KAPGPDG    F+++ W+VVG  V    LGVLN  +    +N T IVLIPK++NP+ +SEFR ISLCNV YK+ISK+L N++K +L S+I  
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIV-------------------------------------------------------GCISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAFIPGR + DN +V                                                        C+S+I FS   NG       P++GLRQ
Subjt:  NQSAFIPGRCVVDNVIV-------------------------------------------------------GCISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDPLSPYLF +C EG+SA+L  A A+  +SG+ I R  P LSHLFFA+DSLLF  A+ +E  V+++ILQ YE+AS Q +NF KS V FS NTD  TQ  V
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L V S   +  YLGLP  +  S+ + + F+K+RVW ++
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

TrEMBL top hitse value%identityAlignment
A0A2N9GB96 Uncharacterized protein7.1e-7142.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+ P KAPGPDGM  AFY+  W+VVG++V++  L  +N+   P  +N T + LIPKV+NP  V+E+R ISLCNV YKLISKVL N++K +L ++I +
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAF+PGR + DNV++                                                        CI+++ +S   NG   G + PS+GLRQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDP+SPYLFLLC EGL+ +L+ A A   I G+ + RRGP L+HLFFADDSLLF RA   E   IQ++L  YEKAS Q +N  K+T+ FS NT   TQ  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L V S   ++ YLGLP+ + + +++    IKDRVW K+
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

A0A2N9GLG8 Reverse transcriptase domain-containing protein7.1e-7142.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+ P KAPGPDGM  AFY+  W+VVG++V++  L  +N+   P  +N T + LIPKV+NP  V+E+R ISLCNV YKLISKVL N++K +L ++I +
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAF+PGR + DNV++                                                        CI+++ +S   NG   G + PS+GLRQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDP+SPYLFLLC EGL+ +L+ A A   I G+ + RRGP L+HLFFADDSLLF RA   E   IQ++L  YEKAS Q +N  K+T+ FS NT   TQ  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L V S   ++ YLGLP+ + + +++    IKDRVW K+
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

A0A2N9GYM1 Reverse transcriptase domain-containing protein7.1e-7142.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+ P KAPGPDGM  AFY+  W+VVG++V++  L  +N+   P  +N T + LIPKV+NP  V+E+R ISLCNV YKLISKVL N++K +L ++I +
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAF+PGR + DNV++                                                        CI+++ +S   NG   G + PS+GLRQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDP+SPYLFLLC EGL+ +L+ A A   I G+ + RRGP L+HLFFADDSLLF RA   E   IQ++L  YEKAS Q +N  K+T+ FS NT   TQ  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L V S   ++ YLGLP+ + + +++    IKDRVW K+
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

A0A2N9H0J9 Uncharacterized protein7.1e-7142.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL Q+ P KAPGPDGM  AFY+  W+VVG++V++  L  +N+   P  +N T + LIPKV+NP  V+E+R ISLCNV YKLISKVL N++K +L ++I +
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAF+PGR + DNV++                                                        CI+++ +S   NG   G + PS+GLRQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDP+SPYLFLLC EGL+ +L+ A A   I G+ + RRGP L+HLFFADDSLLF RA   E   IQ++L  YEKAS Q +N  K+T+ FS NT   TQ  +
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L V S   ++ YLGLP+ + + +++    IKDRVW K+
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

A0A6P6WTP1 uncharacterized protein LOC1137359739.0e-7442.82Show/hide
Query:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ
        AL+Q+HP K+PGPDGM   FY++ W +VG DV R  L VLN    P  LN T +VLIPK  +P  +++FR ISLCNV YKL SK+L N+++  L  +I  
Subjt:  ALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQ

Query:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ
         QSAFIPGRC+ DN+++                                                        CIS++ ++F  NG + G+++P +G+RQ
Subjt:  NQSAFIPGRCVVDNVIVG-------------------------------------------------------CISSIRFSFNFNGIRCGDVKPSQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV
        GDPLSPYLFL+C EG S +L +A+   SIS  RI  RGP LSHL FADD+LLF +A ++EA+ I+ IL  Y+ AS Q VNFDKS V FS NTD   + S+
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSV

Query:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI
          +L+++  P H  YLGLPT +  S+      +K+R+W+KI
Subjt:  NGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-1524.27Show/hide
Query:  KAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKV-RNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQNQSAFIP
        K+PGPDG    FY++  E +   +++    +      P    E  I+LIPK  R+  +   FR ISL N+  K+++K+L N+++  +  LI  +Q  FIP
Subjt:  KAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKV-RNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQNQSAFIP

Query:  G-------------------------------------------------RCVVDNVIVGCISSI----RFSFNFNGIRCGDVKPSQGLRQGDPLSPYLF
        G                                                 +  +D + +  I +I      +   NG +        G RQG PLSP LF
Subjt:  G-------------------------------------------------RCVVDNVIVGCISSI----RFSFNFNGIRCGDVKPSQGLRQGDPLSPYLF

Query:  LLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSVNGVLHVRST
         + +E L+  +      + I G+++ +    LS   FADD +++    +  A+ +  ++  + K S   +N  KS  AF  N + QT+  + G L     
Subjt:  LLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSVNGVLHVRST

Query:  PCHQHYLGL
             YLG+
Subjt:  PCHQHYLGL

P08548 LINE-1 reverse transcriptase homolog6.3e-1624.38Show/hide
Query:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETM----IVLIPKV-RNPRRVSEFRLISLCNVAYKLISKVLVNKMKG
        ++  +  +   K+PGPDG    FY    +   E+++   L +  N E  G L  T     I LIPK  ++P R   +R ISL N+  K+++K+L N+++ 
Subjt:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETM----IVLIPKV-RNPRRVSEFRLISLCNVAYKLISKVLVNKMKG

Query:  LLSSLIVQNQSAFIPG--------------------------------RCVVDNV----IVGCISSIRFSFNF-----------------NGIRCGDVKP
         +  +I  +Q  FIPG                                    DN+    ++  +  I     F                 NG++      
Subjt:  LLSSLIVQNQSAFIPG--------------------------------RCVVDNV----IVGCISSIRFSFNF-----------------NGIRCGDVKP

Query:  SQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDD
          G RQG PLSP LF + +E L+  + + +A   I G+ I      LS   FADD +++     +    +  +++ Y   S   +N  KS VAF    ++
Subjt:  SQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDD

Query:  QTQGSVNGVLHVRSTPCHQHYLGL
        Q + +V   +     P    YLG+
Subjt:  QTQGSVNGVLHVRSTPCHQHYLGL

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-1222.68Show/hide
Query:  KAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKE----SPGGLNETMIVLIPK-VRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQNQS
        K+PGPDG    FY    +   ED++     + +  E     P    E  I LIPK  ++P ++  FR ISL N+  K+++K+L N+++  + ++I  +Q 
Subjt:  KAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKE----SPGGLNETMIVLIPK-VRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQNQS

Query:  AFIPG------------------------RCVVD-----------------------------NVIVGCISSIRFSFNFNGIRCGDVKPSQGLRQGDPLS
         FIPG                          ++                              N+I    S    +   NG +   +    G RQG PLS
Subjt:  AFIPG------------------------RCVVD-----------------------------NVIVGCISSIRFSFNFNGIRCGDVKPSQGLRQGDPLS

Query:  PYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSVNGVLH
        PYLF + +E L+  +      + I G++I +    +S L  ADD +++        + + N++  + +     +N +KS +AF    + Q +  +     
Subjt:  PYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKSTVAFSPNTDDQTQGSVNGVLH

Query:  VRSTPCHQHYLGL
              +  YLG+
Subjt:  VRSTPCHQHYLGL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-1423.86Show/hide
Query:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSL
        +S AL  +  NK+PG DG+   F++  W+ +G D  R         E P      ++ L+PK  + R +  +R +SL +  YK+++K +  ++K +L+ +
Subjt:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSL

Query:  IVQNQSAFIPGRCVVDNV-----------------------------------IVGCISSIRFSFNFNG--------------IRCGDVKP---SQGLRQ
        I  +QS  +PGR + DNV                                   ++G + +  F   F G              I      P    +G+RQ
Subjt:  IVQNQSAFIPGRCVVDNV-----------------------------------IVGCISSIRFSFNFNG--------------IRCGDVKP---SQGLRQ

Query:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKST
        G PLS  L+ L +E    +L      + ++GL +      +    +ADD +L  +  V+  +  Q   + Y  AS   +N+ KS+
Subjt:  GDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYEKASDQVVNFDKST

P92555 Uncharacterized mitochondrial protein AtMg012504.5e-1452.17Show/hide
Query:  FNFNGIRCGDVKPSQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDS
        F  NG   G V PS+GLRQGDPLSPYLF+LC E LS +   A+    + G+R+S   P ++HL FADD+
Subjt:  FNFNGIRCGDVKPSQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.5e-0833.72Show/hide
Query:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLIS
        ++ A+  +  NKAPGPD     F+ + W VV +  +                N T I LIPKV    ++S FR +S C V YK+I+
Subjt:  MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLIS

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.2e-1552.17Show/hide
Query:  FNFNGIRCGDVKPSQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDS
        F  NG   G V PS+GLRQGDPLSPYLF+LC E LS +   A+    + G+R+S   P ++HL FADD+
Subjt:  FNFNGIRCGDVKPSQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGGCATTGGCTCAGATTCACCCGAATAAAGCCCCTGGCCCAGATGGGATGTTAGGGGCCTTTTATCGTCAGGAATGGGAGGTAGTTGGGGAGGATGTGATGAG
GTGTTGTCTGGGGGTCCTGAATAATAAAGAGTCGCCTGGGGGTCTAAACGAGACGATGATAGTGTTGATCCCGAAGGTGAGAAATCCTAGACGGGTGTCTGAGTTTCGTC
TTATCTCCTTGTGTAATGTGGCTTATAAGTTGATTTCGAAAGTGTTAGTGAACAAGATGAAAGGGTTGTTGTCCAGCTTGATAGTCCAGAACCAGAGTGCTTTCATTCCA
GGGCGGTGTGTGGTAGATAATGTCATTGTAGGGTGTATCTCGTCAATCAGGTTTTCCTTTAATTTCAATGGAATTCGTTGCGGGGATGTGAAACCGAGTCAGGGTCTACG
ACAAGGGGACCCCCTGTCGCCGTATCTATTTTTGTTATGTGTCGAAGGTTTATCTGCCATGCTTCATGATGCAGAGGCTTCCAGGTCCATATCAGGTCTGAGGATATCGA
GACGGGGCCCAGCGCTGTCACACCTTTTCTTTGCAGATGACAGCCTTTTGTTCTTCCGAGCTAGAGTGGAGGAAGCGAAGGTCATTCAGAATATCCTGCAGTGCTATGAG
AAGGCGTCCGATCAGGTGGTGAATTTTGATAAGTCCACAGTAGCTTTTAGTCCGAATACTGATGATCAGACTCAAGGGAGTGTGAATGGGGTTCTCCACGTGCGATCTAC
TCCCTGCCATCAACATTATTTGGGACTCCCTACCTTTATGCCTCGTAGTAGAGTGAGCTCTATGAAGTTCATTAAGGATAGAGTATGGAGGAAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGGCATTGGCTCAGATTCACCCGAATAAAGCCCCTGGCCCAGATGGGATGTTAGGGGCCTTTTATCGTCAGGAATGGGAGGTAGTTGGGGAGGATGTGATGAG
GTGTTGTCTGGGGGTCCTGAATAATAAAGAGTCGCCTGGGGGTCTAAACGAGACGATGATAGTGTTGATCCCGAAGGTGAGAAATCCTAGACGGGTGTCTGAGTTTCGTC
TTATCTCCTTGTGTAATGTGGCTTATAAGTTGATTTCGAAAGTGTTAGTGAACAAGATGAAAGGGTTGTTGTCCAGCTTGATAGTCCAGAACCAGAGTGCTTTCATTCCA
GGGCGGTGTGTGGTAGATAATGTCATTGTAGGGTGTATCTCGTCAATCAGGTTTTCCTTTAATTTCAATGGAATTCGTTGCGGGGATGTGAAACCGAGTCAGGGTCTACG
ACAAGGGGACCCCCTGTCGCCGTATCTATTTTTGTTATGTGTCGAAGGTTTATCTGCCATGCTTCATGATGCAGAGGCTTCCAGGTCCATATCAGGTCTGAGGATATCGA
GACGGGGCCCAGCGCTGTCACACCTTTTCTTTGCAGATGACAGCCTTTTGTTCTTCCGAGCTAGAGTGGAGGAAGCGAAGGTCATTCAGAATATCCTGCAGTGCTATGAG
AAGGCGTCCGATCAGGTGGTGAATTTTGATAAGTCCACAGTAGCTTTTAGTCCGAATACTGATGATCAGACTCAAGGGAGTGTGAATGGGGTTCTCCACGTGCGATCTAC
TCCCTGCCATCAACATTATTTGGGACTCCCTACCTTTATGCCTCGTAGTAGAGTGAGCTCTATGAAGTTCATTAAGGATAGAGTATGGAGGAAGATTTAG
Protein sequenceShow/hide protein sequence
MSGALAQIHPNKAPGPDGMLGAFYRQEWEVVGEDVMRCCLGVLNNKESPGGLNETMIVLIPKVRNPRRVSEFRLISLCNVAYKLISKVLVNKMKGLLSSLIVQNQSAFIP
GRCVVDNVIVGCISSIRFSFNFNGIRCGDVKPSQGLRQGDPLSPYLFLLCVEGLSAMLHDAEASRSISGLRISRRGPALSHLFFADDSLLFFRARVEEAKVIQNILQCYE
KASDQVVNFDKSTVAFSPNTDDQTQGSVNGVLHVRSTPCHQHYLGLPTFMPRSRVSSMKFIKDRVWRKI