; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014529 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014529
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr02:13619900..13621857
RNA-Seq ExpressionHG10014529
SyntenyHG10014529
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]9.9e-2926.22Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-------------NRN------------
        GL+ R GNG +I++Y D W+P    FK +S         V +     G+W +  L++ F  ++++   +IP+              RN            
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-------------NRN------------

Query:  --TKDR--------------------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDF
           KD+                          KIKFF WR   D +P    L  R I    IC  C    ES+ H++  C  AK +W+N     +  +  
Subjt:  --TKDR--------------------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDF

Query:  NNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIG--RVPLGFFKLNVDA
         NSFR+ W AL  S   +E  L    CW + N RN+                + K  +EF  S++ N   ++  ++ S    + G    P G +K+NVD 
Subjt:  NNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIG--RVPLGFFKLNVDA

Query:  AWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEI
        A KS  +  G+G ++R++NG+  +   R +   +     E++A  EG+RFA   G +  V+E D    IN +   E    I    I+E+
Subjt:  AWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEI

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.3e-3629Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRI------------------------------
        G++ R G+G+ + +Y+D WIPR  TF+P+S      E  VA+ ID + KW +++LE+ F++EDI  I +I                              
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRI------------------------------

Query:  --------PINRNTKDR------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLL-------
                P + N+  R            K+K F WRA+ +I+P   NL KR      IC  CK   E++ H LI C  A+ IW     +PL+       
Subjt:  --------PINRNTKDR------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLL-------

Query:  SQDFNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGL-GKPIPDHII--KCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFK
        +QDF ++ ++ W    S   T E  L+ + CW I + RN     GK      +  K D +LK Y+   K  + +       K   +  +          K
Subjt:  SQDFNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGL-GKPIPDHII--KCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFK

Query:  LNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNNAKK
        LNVDAA  +     GLGAI+RD+ G++ +V  +        SLAE  AI  G++ A     S+L+VESDC + + LLN+ +      ++ + ++R  +K+
Subjt:  LNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNNAKK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.9e-4332.22Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPIN---------------------------
        GL+LR GNG TI+ + DPW+PR  TFKPL +N    +  VA+FI  DG W +  +  +F  ED + I  +PI+                           
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPIN---------------------------

Query:  -----------------------RNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIV-HSPLLSQDFNN
                               + T   KIK F WR+ ++ IP   NLL RGI     C++C    ESI H+   C RA+ IW+ +      LS + N 
Subjt:  -----------------------RNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIV-HSPLLSQDFNN

Query:  SFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKS--SSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAAW
        SF + W +L+     K+LNL  IT W I NDRN+   GK +     KC+W+  + +   ++  S+ + R     + +         V L   KLN DAA 
Subjt:  SFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKS--SSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAAW

Query:  KSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIR
        +    ST  G IIRDS+  L +  +  +     P LAEI  I EG++FAAA   ++L VESD   AI L+ ++         W+ EI+
Subjt:  KSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIR

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]2.1e-3128.1Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-----------------NRNTKD-----
        G++ R GNG+ I I+ D W+PR  TF+P+        + VA+ I  D +W   KL + F+  D   I +IP+                 N + K      
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-----------------NRNTKD-----

Query:  ----------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFNNS
                                     K+K F WRA N+++P+  NL KR +     C  CK S E+I H+L+ C  A+ IW   + SP  +     +
Subjt:  ----------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFNNS

Query:  FRDRWIALS--SSRLTK-ELNLITITCWAILNDRNNAGLGKPIPDHII---KCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVD
         +D +  L   +  L K +L L+   CW+    RN         + II   K + +L  ++   K    +  +S+ +K+   L       P   FK+NVD
Subjt:  FRDRWIALS--SSRLTK-ELNLITITCWAILNDRNNAGLGKPIPDHII---KCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVD

Query:  AAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNNAK
        AA+ S   S G+GA+IRDSNG++ +       L    SLAE  A+  G++ A     S+L++ESDC + + L+N+ +       + I  I+N  K
Subjt:  AAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNNAK

XP_030926547.1 uncharacterized protein LOC115953156 [Quercus lobata]1.5e-2927.82Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKE-AKVANFIDQDGK-WILEKLEEAFIQEDINCIKRIPINRNTK---------------------
        G + R GNG+ I I+ D W+P   T+K +S      E   V++ I+   K W ++ +   F+  +   I +IP++R                        
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKE-AKVANFIDQDGK-WILEKLEEAFIQEDINCIKRIPINRNTK---------------------

Query:  --------------------------------DRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQ
                                          KIK F WRA  + +P    + +RGI+ N+ C +C    ES+DH+L+ C  + L+W   + +PL +Q
Subjt:  --------------------------------DRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQ

Query:  DFNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNN-----AGLGKPIPDHIIKCDW--ILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGF
         F NSF D  + + S    ++L +   T WAI ++RNN      GL    P H+    W       EEF  S+S +    L +   S         P G 
Subjt:  DFNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNN-----AGLGKPIPDHIIKCDW--ILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGF

Query:  FKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLND
        FK+NVD A      S+ +G +IRDSNGQ+ + +   L   F   L+E+ A+ +G+ FA       ++VESD    I  +ND
Subjt:  FKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLND

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon4.8e-2926.22Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-------------NRN------------
        GL+ R GNG +I++Y D W+P    FK +S         V +     G+W +  L++ F  ++++   +IP+              RN            
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPI-------------NRN------------

Query:  --TKDR--------------------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDF
           KD+                          KIKFF WR   D +P    L  R I    IC  C    ES+ H++  C  AK +W+N     +  +  
Subjt:  --TKDR--------------------------KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDF

Query:  NNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIG--RVPLGFFKLNVDA
         NSFR+ W AL  S   +E  L    CW + N RN+                + K  +EF  S++ N   ++  ++ S    + G    P G +K+NVD 
Subjt:  NNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIG--RVPLGFFKLNVDA

Query:  AWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEI
        A KS  +  G+G ++R++NG+  +   R +   +     E++A  EG+RFA   G +  V+E D    IN +   E    I    I+E+
Subjt:  AWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEI

A0A6J1DX30 uncharacterized protein LOC1110248749.0e-4432.22Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPIN---------------------------
        GL+LR GNG TI+ + DPW+PR  TFKPL +N    +  VA+FI  DG W +  +  +F  ED + I  +PI+                           
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPIN---------------------------

Query:  -----------------------RNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIV-HSPLLSQDFNN
                               + T   KIK F WR+ ++ IP   NLL RGI     C++C    ESI H+   C RA+ IW+ +      LS + N 
Subjt:  -----------------------RNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIV-HSPLLSQDFNN

Query:  SFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKS--SSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAAW
        SF + W +L+     K+LNL  IT W I NDRN+   GK +     KC+W+  + +   ++  S+ + R     + +         V L   KLN DAA 
Subjt:  SFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKS--SSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAAW

Query:  KSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIR
        +    ST  G IIRDS+  L +  +  +     P LAEI  I EG++FAAA   ++L VESD   AI L+ ++         W+ EI+
Subjt:  KSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIR

A0A803QH54 Uncharacterized protein4.1e-2825.06Show/hide
Query:  GLKLRSGNGETIRIYQDPWI--PRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKD--------------------
        GL  + GNG+TIR  QD WI  PRS  FK  S  P+    KV+ FI+ +G W L +L E F  + + CI ++PI     D                    
Subjt:  GLKLRSGNGETIRIYQDPWI--PRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKD--------------------

Query:  ------------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWK--NIVHSPLLSQD
                                       K+K F WR    I+P   NL +R +  +  CS C  S E++ H+L+ CSRA+ +WK   + H  +L + 
Subjt:  ------------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWK--NIVHSPLLSQD

Query:  FNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAA
         ++  +D  ++  +   T + +L+  T W+I   RN         +      WI  +  ++    ++ KR+ +     + +     +V  G ++L  DAA
Subjt:  FNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGRVPLGFFKLNVDAA

Query:  WKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNN
         ++     GLGA+++D NGQ+ + ++  +  +  P+LAE  A+R  + + ++      ++ SDC Q +  ++  + +       + +IRN+
Subjt:  WKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRNN

A0A803QH76 Uncharacterized protein2.5e-3026.51Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSY--NPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKD--------------------
        GL+ + GNG+T+R  +DPW+P + TF P  Y  +P F    V ++I+ + +W ++ L++ F   D+  I  IP++   KD                    
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSY--NPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKD--------------------

Query:  ------------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFN
                                      +K+K F WR +ND +P  +NL  R I  +  C+LCK S ES+ H+L  C RAKL+W     +  +    N
Subjt:  ------------------------------RKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFN

Query:  NSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLG-KPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKK---RLSLLVEVIGRV-------PLG
            D +  L+++    +L +IT   W I ++RN    G KP P  I+ C +   Y  ++  ++++ K  S         S+ V+   +          G
Subjt:  NSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLG-KPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKK---RLSLLVEVIGRV-------PLG

Query:  FFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLN
         +KLNVDAA        G GAI+RD +G + + +++     F P   E+ A+   +++A         +E+D    +N LN
Subjt:  FFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLN

A2YJD5 Reverse transcriptase domain-containing protein3.1e-2826.68Show/hide
Query:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKDR------------KIKFFCWRA
        G+  R GNG +IRI++DPWIPR+++ K +S   + +   V++ +D DG W   ++   F+  D   I  I  +R   +              +K F W+A
Subjt:  GLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKDR------------KIKFFCWRA

Query:  INDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFNNSFRDRWIALSSSRLTKELNLIT--------ITCWAILND
        I + +   LN  KR +  +  C++C    E + H+L  C +AK +W  + +    S D       RW    S  +  +L ++         +  W   + 
Subjt:  INDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFNNSFRDRWIALSSSRLTKELNLIT--------ITCWAILND

Query:  RNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKR-VSLFKKRLSLLVEVIGR--------------VPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNG
        RN    GK  P       +I+ Y+   L+     K  ++  K  +  +V   GR                 G+ KLNVD +++ +  S G+GAI+R+S G
Subjt:  RNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKR-VSLFKKRLSLLVEVIGR--------------VPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNG

Query:  QLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRN
        ++      S++       AE+LA R+G+          +V+ESDC QAI L+   E E     + I EI++
Subjt:  QLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIANYWIDEIRN

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.2e-0619.1Show/hide
Query:  QDGKWILEKLEEAFIQEDI---------NCIKRIPINRNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKN
        QDG++ +    E    +++         NC+ ++ +       ++K F W   N  +       +R ++ + +C +CK   ES+ H L  C     IW  
Subjt:  QDGKWILEKLEEAFIQEDI---------NCIKRIPINRNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWKN

Query:  IVHSPLLSQDFNNSFRDRWIALSSSRLTKELNLITITCWAIL-------NDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVE
        +V        F+ S  + W+  +    +   ++   T +A++          N  G      D +    ++ ++  E  ++ S N  V + + R+  ++ 
Subjt:  IVHSPLLSQDFNNSFRDRWIALSSSRLTKELNLITITCWAIL-------NDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVE

Query:  VIGRVPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLL
         +    +G+ K+N D A + NP     G ++RD  G      + ++     P  AE+  +  G+ FA  +    + +E D    +  L
Subjt:  VIGRVPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLL

Arabidopsis top hitse value%identityAlignment
AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.1e-0434.43Show/hide
Query:  KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGC--SRAKLIWKNIV
        KIK   W+A+N+ +P    LL R I++   C+ C+   E+I H L  C  ++ ++I K+I+
Subjt:  KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGC--SRAKLIWKNIV

AT4G29090.1 Ribonuclease H-like superfamily protein8.4e-1021.6Show/hide
Query:  KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWK-NIVHSPLLSQDFNNSFRDRW----IALSSSRLTKELNLITITC
        KI+ F W+ +++ +P    L  R ++    C  C +  E+++H L  C+ A+L W  + +  PL  +  ++ + + +    +   + +  K   L+    
Subjt:  KIKFFCWRAINDIIPANLNLLKRGINLNLICSLCKASPESIDHSLIGCSRAKLIWK-NIVHSPLLSQDFNNSFRDRW----IALSSSRLTKELNLITITC

Query:  WAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGR---VPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVV
        W +  +RN                 +L+  E+ L+             +  +     GR    P  + K N DA W  +    G+G ++R+  G++K + 
Subjt:  WAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVSLFKKRLSLLVEVIGR---VPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVV

Query:  ARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDE
        AR+L        AE+ A+R  +   +    + ++ ESD    I +LN+DE
Subjt:  ARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGTTTGAAACTACGTTCTGGTAATGGGGAAACAATAAGAATCTATCAAGATCCCTGGATACCTAGAAGCTACACTTTCAAGCCTTTATCTTACAACCCTGCATT
TAAGGAAGCCAAAGTTGCAAACTTCATTGATCAGGATGGGAAGTGGATTTTGGAAAAACTGGAAGAGGCCTTCATTCAAGAGGATATTAATTGCATCAAAAGAATCCCAA
TTAACAGGAATACTAAAGACAGGAAAATAAAATTTTTCTGTTGGAGAGCTATTAATGACATCATCCCAGCCAATCTGAATCTTTTGAAAAGAGGTATAAATTTAAATCTC
ATTTGTTCCTTATGCAAGGCCTCACCTGAATCAATTGATCATTCCCTAATTGGATGCTCCCGAGCAAAATTGATCTGGAAAAACATTGTTCATTCGCCACTCCTTAGCCA
GGACTTCAACAACAGTTTTCGTGATAGATGGATTGCTTTAAGCTCATCCAGACTGACTAAGGAGCTAAACCTTATTACAATCACTTGCTGGGCCATCTTGAACGACAGAA
ACAACGCAGGCCTGGGAAAGCCAATTCCAGACCATATTATAAAATGCGATTGGATCTTAAAGTATTATGAAGAGTTCTTAAAATCCTCCTCAAGAAATAAAAGAGTCTCT
TTGTTTAAAAAAAGATTGTCCCTACTTGTCGAAGTAATTGGAAGAGTCCCCCTGGGTTTCTTTAAACTGAATGTTGATGCGGCGTGGAAATCTAACCCGACTTCAACAGG
TTTGGGTGCGATCATCAGAGATTCAAATGGACAGCTTAAAAGTGTAGTCGCTCGTTCCCTTGATCTGGACTTTGATCCTTCTTTAGCTGAAATCCTTGCCATTCGTGAAG
GCATCCGCTTTGCTGCTGCTCAAGGTTGCTCTAATCTGGTTGTGGAATCAGACTGTGCCCAAGCAATTAACTTACTTAATGATGATGAAGCTGAGTTCAGAATTGCAAAC
TACTGGATTGATGAAATCAGGAACAATGCAAAAAAAAAATTCTTCTATTTCCTTCATTTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGTTTGAAACTACGTTCTGGTAATGGGGAAACAATAAGAATCTATCAAGATCCCTGGATACCTAGAAGCTACACTTTCAAGCCTTTATCTTACAACCCTGCATT
TAAGGAAGCCAAAGTTGCAAACTTCATTGATCAGGATGGGAAGTGGATTTTGGAAAAACTGGAAGAGGCCTTCATTCAAGAGGATATTAATTGCATCAAAAGAATCCCAA
TTAACAGGAATACTAAAGACAGGAAAATAAAATTTTTCTGTTGGAGAGCTATTAATGACATCATCCCAGCCAATCTGAATCTTTTGAAAAGAGGTATAAATTTAAATCTC
ATTTGTTCCTTATGCAAGGCCTCACCTGAATCAATTGATCATTCCCTAATTGGATGCTCCCGAGCAAAATTGATCTGGAAAAACATTGTTCATTCGCCACTCCTTAGCCA
GGACTTCAACAACAGTTTTCGTGATAGATGGATTGCTTTAAGCTCATCCAGACTGACTAAGGAGCTAAACCTTATTACAATCACTTGCTGGGCCATCTTGAACGACAGAA
ACAACGCAGGCCTGGGAAAGCCAATTCCAGACCATATTATAAAATGCGATTGGATCTTAAAGTATTATGAAGAGTTCTTAAAATCCTCCTCAAGAAATAAAAGAGTCTCT
TTGTTTAAAAAAAGATTGTCCCTACTTGTCGAAGTAATTGGAAGAGTCCCCCTGGGTTTCTTTAAACTGAATGTTGATGCGGCGTGGAAATCTAACCCGACTTCAACAGG
TTTGGGTGCGATCATCAGAGATTCAAATGGACAGCTTAAAAGTGTAGTCGCTCGTTCCCTTGATCTGGACTTTGATCCTTCTTTAGCTGAAATCCTTGCCATTCGTGAAG
GCATCCGCTTTGCTGCTGCTCAAGGTTGCTCTAATCTGGTTGTGGAATCAGACTGTGCCCAAGCAATTAACTTACTTAATGATGATGAAGCTGAGTTCAGAATTGCAAAC
TACTGGATTGATGAAATCAGGAACAATGCAAAAAAAAAATTCTTCTATTTCCTTCATTTTTTGTAA
Protein sequenceShow/hide protein sequence
MLGLKLRSGNGETIRIYQDPWIPRSYTFKPLSYNPAFKEAKVANFIDQDGKWILEKLEEAFIQEDINCIKRIPINRNTKDRKIKFFCWRAINDIIPANLNLLKRGINLNL
ICSLCKASPESIDHSLIGCSRAKLIWKNIVHSPLLSQDFNNSFRDRWIALSSSRLTKELNLITITCWAILNDRNNAGLGKPIPDHIIKCDWILKYYEEFLKSSSRNKRVS
LFKKRLSLLVEVIGRVPLGFFKLNVDAAWKSNPTSTGLGAIIRDSNGQLKSVVARSLDLDFDPSLAEILAIREGIRFAAAQGCSNLVVESDCAQAINLLNDDEAEFRIAN
YWIDEIRNNAKKKFFYFLHFL