; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007862 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007862
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr10:16121942..16125208
RNA-Seq ExpressionHG10007862
SyntenyHG10007862
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO69928.1 reverse transcriptase [Corchorus capsularis]6.7e-3751.27Show/hide
Query:  HAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI
        H VT EMN  L++ ++ EE+   +NQM  TKAPGPD   Q+ I +I ++ E   +  +RPISLCNV+YKI++K + NRLK+     ISENQSAF+P R I
Subjt:  HAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI

Query:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVIS
         DNI++ +E +H L++ + GK G  A+KLD+SKAY+RVEW FL A+M RLGFD R ++
Subjt:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVIS

XP_016694556.2 uncharacterized protein LOC107911187 [Gossypium hirsutum]1.6e-3835.97Show/hide
Query:  IEVRFTSG-SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQF-----SLKERLNRCSKS
        I+ R   G +N+ W  LF  V + HL   FSNH  ++  L   +   +  N  F+FE +W  ++    I E   LWE++  +       LKE L   ++ 
Subjt:  IEVRFTSG-SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQF-----SLKERLNRCSKS

Query:  LKVSRRGKNKNLKSR----------------IMECKLALQAAYDN----PHAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELW
        ++ +R+   + L  +                +++ K++L    +        +  E N +L A YS+EEV   +  M  TKAPG DGF  +F     + W
Subjt:  LKVSRRGKNKNLKSR----------------IMECKLALQAAYDN----PHAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELW

Query:  EV--GDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG
        ++   D+RPISLCNV+YKI+ K IANRL+L+  + I E QSAF+  R I +N+++ +E +H LK K++GK G +AVKLD+SKAY+RVEW F+  IM ++G
Subjt:  EV--GDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG

Query:  FDL
        FDL
Subjt:  FDL

XP_021732105.1 uncharacterized protein LOC110698909 [Chenopodium quinoa]1.5e-3634.44Show/hide
Query:  GSNSFWELFNSVSILHLDWLFSNHHSI-MKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQFSLKERLNRCSKSLKVSRRGKNKNLK
        GS+ +  +F  V++ HL  + S+H  I +   +  +   +RK   FKFE +W  ++ C+  IE  G WE       +  ++    + L   R   ++NL 
Subjt:  GSNSFWELFNSVSILHLDWLFSNHHSI-MKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQFSLKERLNRCSKSLKVSRRGKNKNLK

Query:  SRIMECKLALQAAYDNPHAVTN-----EMNDKLVAR----------------YSKEEVERPINQMFLTKAPGPDG----FQQSFINIIG-----------
         +I   + ALQ A + P  V +     E+  K++ R                YS+EEV   +  M  +KAPGPDG    F Q F +I+G           
Subjt:  SRIMECKLALQAAYDNPHAVTN-----EMNDKLVAR----------------YSKEEVERPINQMFLTKAPGPDG----FQQSFINIIG-----------

Query:  ---------------------ELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKL
                             E     D+RPISLCNVIYK++TK I  RLK +   IISENQSAF+P R I DN +I  EC H +K +R G+ G +A+KL
Subjt:  ---------------------ELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKL

Query:  DLSKAYNRVEWSFLIAIMARLGFDLRVISYS
        D+SKAY+RVEW FL  ++ ++GFD R ++ S
Subjt:  DLSKAYNRVEWSFLIAIMARLGFDLRVISYS

XP_041026997.1 uncharacterized protein LOC121267211 [Juglans microcarpa x Juglans regia]5.6e-3632.26Show/hide
Query:  SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQFS-LKERLN---RCSKSLK--VSRRGK
        +N  W E F  V +  L    S+H  ++    + +   +R+ + FKFE  W ++++C+ +++K  +WE     +  +K+ LN   RCS  L    S+R +
Subjt:  SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQFS-LKERLN---RCSKSLK--VSRRGK

Query:  NKNLKSRIMECKLALQAAYDNPHA----------------------------------------------------VTNEMNDKLVARYSKEEVERPINQ
         +  K  ++  ++      + P                                                      VT+ MN+ L  R+ +EEVE  +  
Subjt:  NKNLKSRIMECKLALQAAYDNPHA----------------------------------------------------VTNEMNDKLVARYSKEEVERPINQ

Query:  MFLTKAPGPDGFQQSFI-----NIIGEL---WEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGK
        M   K+ GPDG+   F       + GEL    +VGD+RPISLCNV+YK++ KT+ANRLK + D ++S+NQSAFIP R I DNI+  +E +H +K +  GK
Subjt:  MFLTKAPGPDGFQQSFI-----NIIGEL---WEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGK

Query:  DGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVISYSEG
         G +A+KLD+SKAY+RVEW +L  IM ++GF  R IS   G
Subjt:  DGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVISYSEG

XP_042988698.1 uncharacterized protein LOC122316231 [Carya illinoinensis]2.4e-3925.95Show/hide
Query:  SNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWE---SNHCQFSLKERLNRCSKSLKVSRRGKNKNLKSRIMECKLALQA-------
        S+H  I+ V S   +  + +N  F+FE  W  DDEC  +I +   W+    N    S+ +RL+ CS+ L+   + K  ++K +I   + +LQ+       
Subjt:  SNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWE---SNHCQFSLKERLNRCSKSLKVSRRGKNKNLKSRIMECKLALQA-------

Query:  ----------------------AYDNPHA--------------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-------------------FQQ
                                 N  A              VT +MN +L+  ++ EEV+  +NQM  TK+PGPDG                      
Subjt:  ----------------------AYDNPHA--------------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-------------------FQQ

Query:  SFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEW
        +FI +I +  +   +  +RPISLCNV+YK+++K +ANRLK + D +IS +Q AF+P R I DNI+I +E +HYL+ +R GK+G +++KLD+ KAY+RVEW
Subjt:  SFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEW

Query:  SFLIAIMARLGFD----------LRVISY--------------SEGMR----------------------------------------------------
        SF+ ++M+R+GF+          ++ +S+              S G+R                                                    
Subjt:  SFLIAIMARLGFD----------LRVISY--------------SEGMR----------------------------------------------------

Query:  -------------------FRIADGKSIR------MFLNPWIPKEITFKPRCLDERNT---------------CNKGKKLIPHNLRSNWIH---SYYEEF
                           + +A G+ I       +F    IP+           +N+                  G+ LI  N  +N I+   S Y  F
Subjt:  -------------------FRIADGKSIR------MFLNPWIPKEITFKPRCLDERNT---------------CNKGKKLIPHNLRSNWIH---SYYEEF

Query:  E-------------ASLKRNL-----SLKSVDHDSFK---------IQAKC-WIPPPSGFSKLNTDIAWSPSPSSTILGLVIQDDCGSISAISASHLPID
        +             A L R +     ++K     S+K         I+ +C W  PPSG  KLN D A+S + S    G +++D  G +  +SA+   + 
Subjt:  E-------------ASLKRNL-----SLKSVDHDSFK---------IQAKC-WIPPPSGFSKLNTDIAWSPSPSSTILGLVIQDDCGSISAISASHLPID

Query:  FTPPL-VEVLAITDGLKLALSLEKRWLIVESDSLQAINLIVRKAEAEGEVSYWIEEICYLKDKFDYCLLGHVSRSCNLFVDSIAKW
            + VE LAI  GL+  + L    LI+ESDSL  +     +  ++      I E   L  +F  C + HVSRS N    S+AK+
Subjt:  FTPPL-VEVLAITDGLKLALSLEKRWLIVESDSLQAINLIVRKAEAEGEVSYWIEEICYLKDKFDYCLLGHVSRSCNLFVDSIAKW

TrEMBL top hitse value%identityAlignment
A0A1R3HHY9 Reverse transcriptase3.2e-3751.27Show/hide
Query:  HAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI
        H VT EMN  L++ ++ EE+   +NQM  TKAPGPD   Q+ I +I ++ E   +  +RPISLCNV+YKI++K + NRLK+     ISENQSAF+P R I
Subjt:  HAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELWE---VGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI

Query:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVIS
         DNI++ +E +H L++ + GK G  A+KLD+SKAY+RVEW FL A+M RLGFD R ++
Subjt:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVIS

A0A1U8JWA5 uncharacterized protein LOC1079111877.7e-3935.97Show/hide
Query:  IEVRFTSG-SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQF-----SLKERLNRCSKS
        I+ R   G +N+ W  LF  V + HL   FSNH  ++  L   +   +  N  F+FE +W  ++    I E   LWE++  +       LKE L   ++ 
Subjt:  IEVRFTSG-SNSFW-ELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQF-----SLKERLNRCSKS

Query:  LKVSRRGKNKNLKSR----------------IMECKLALQAAYDN----PHAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELW
        ++ +R+   + L  +                +++ K++L    +        +  E N +L A YS+EEV   +  M  TKAPG DGF  +F     + W
Subjt:  LKVSRRGKNKNLKSR----------------IMECKLALQAAYDN----PHAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELW

Query:  EV--GDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG
        ++   D+RPISLCNV+YKI+ K IANRL+L+  + I E QSAF+  R I +N+++ +E +H LK K++GK G +AVKLD+SKAY+RVEW F+  IM ++G
Subjt:  EV--GDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG

Query:  FDL
        FDL
Subjt:  FDL

A0A2N9HTH6 Reverse transcriptase domain-containing protein1.9e-3734.34Show/hide
Query:  FNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLW----ESNHCQFSLKERLNRCSKSLKVSRRGKNKNLKSRIME
        +    + HL  L S+H  ++  +    +  +RK   F FE  W +D++CKS+IE A  W    +     F + E+L  C  +L      K  N  SRI  
Subjt:  FNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLW----ESNHCQFSLKERLNRCSKSLKVSRRGKNKNLKSRIME

Query:  CKLALQ-AAYDNPHA----------------------------------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG----FQQSFINIIG--
         +  LQ    ++P                                    VT  MN +LVA ++ EEV + + QM+ TKAPGPDG    F QS+  I+G  
Subjt:  CKLALQ-AAYDNPHA----------------------------------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG----FQQSFINIIG--

Query:  ------------------------------ELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIG
                                      E  ++ DYRPI+LCNVIYKIV+K +ANRLK +   +IS  QSAF+P R I DN+++  E +H +  K  G
Subjt:  ------------------------------ELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIG

Query:  KDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF
        K G +A+KLD+SKAY+RVEW F+ A+M RLGF
Subjt:  KDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF

A0A2N9IXL7 Uncharacterized protein9.4e-3736.18Show/hide
Query:  SNSFWELFNSVSILHLDWLFSNHHSIMKVLSLGRSNG-----KRKNWPFKFEEFWTRDDECKSIIEKAGLWE---SNHCQFSLKERLNRCSKSLKVSRRG
        S  +  LF   +I HL    S+H  ++  L+   ++G     +R+   F+FE+ W R+  C+ +I  AG W         + + E++ +C  +L    + 
Subjt:  SNSFWELFNSVSILHLDWLFSNHHSIMKVLSLGRSNG-----KRKNWPFKFEEFWTRDDECKSIIEKAGLWE---SNHCQFSLKERLNRCSKSLKVSRRG

Query:  KNKNLKSRIMECKLALQAAYD------NPHA-----------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-FQQSFINIIGELW---EVGDYR
         +   +S   E +    A ++      NP A           VT  MN++L+  + KEEV++ + QM+ +KAPGPDG    + I +I ++    ++  +R
Subjt:  KNKNLKSRIMECKLALQAAYD------NPHA-----------VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-FQQSFINIIGELW---EVGDYR

Query:  PISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF
        PISLCNV+YKI +K + NR+K +  ++ISE+QSAF+P R I DN++I  E IHYLKN R G +  +A KLD+SKAY+RVEW +L AIM +LGF
Subjt:  PISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF

A0A803PVP3 Uncharacterized protein4.2e-3751.97Show/hide
Query:  VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-----FQQSFINIIGELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI
        VT EMND L A+ S++EV + +  M   K+PGPDG     +Q+ + +   +   +GD RPI+LCNV++K++TK +ANR+K + D+I++ NQSAFIP R I
Subjt:  VTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDG-----FQQSFINIIGELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAI

Query:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF
        IDNI++  E +HYLK KR GKDG +A+KLD+SKAY+RVEW FL AI+ ++GF
Subjt:  IDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-0528Show/hide
Query:  DYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFD---
        ++RPISL N+  KI+ K +ANR++    ++I  +Q  FIP      NI      I ++   R      V + +D  KA+++++  F++  + +LG D   
Subjt:  DYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFD---

Query:  LRVIS--YSEGMRFRIADGKSIRMF
        L++I   Y +     I +G+ +  F
Subjt:  LRVIS--YSEGMRFRIADGKSIRMF

P08548 LINE-1 reverse transcriptase homolog8.8e-0825.11Show/hide
Query:  YDNPHAVTNEMNDKLVARYSKEEVE---RPINQMFLT---------KAPGPDGFQQSF------------INIIGELWEVG-------------------
        Y+N   +   +    + R S++EVE   RPI+   +          K+PGPDGF   F            +N+   + + G                   
Subjt:  YDNPHAVTNEMNDKLVARYSKEEVE---RPINQMFLT---------KAPGPDGFQQSF------------INIIGELWEVG-------------------

Query:  ------DYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARL
              +YRPISL N+  KI+ K + NR++    +II  +Q  FIP      NI      I ++ NK   KD ++ + +D  KA++ ++  F+I  + ++
Subjt:  ------DYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARL

Query:  GFD---LRVIS--YSEGMRFRIADGKSIRMF
        G +   L++I   YS+     I +G  ++ F
Subjt:  GFD---LRVIS--YSEGMRFRIADGKSIRMF

P11369 LINE-1 retrotransposable element ORF2 protein2.5e-1029.38Show/hide
Query:  DKLVARYSKEEVERPINQMFLTKAPGPDGFQQSF--------INIIGELW-----------------------------EVGDYRPISLCNVIYKIVTKT
        D L +  S +E+E  IN +   K+PGPDGF   F        I I+ +L+                             ++ ++RPISL N+  KI+ K 
Subjt:  DKLVARYSKEEVERPINQMFLTKAPGPDGFQQSF--------INIIGELW-----------------------------EVGDYRPISLCNVIYKIVTKT

Query:  IANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG
        +ANR++     II  +Q  FIP      NI      IHY+ NK   K+ ++ + LD  KA+++++  F+I ++ R G
Subjt:  IANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLG

P14381 Transposon TX1 uncharacterized 149 kDa protein3.8e-1132.46Show/hide
Query:  GELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMAR
        G+L  + ++RP+SL +  YKIV K I+ RLK +  E+I  +QS  +P R I DN+ +  + +H+ +   +    L  + LD  KA++RV+  +LI  +  
Subjt:  GELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMAR

Query:  LGFDLRVISYSEGM
          F  + + Y + M
Subjt:  LGFDLRVISYSEGM

Q03279 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.9e-0429.63Show/hide
Query:  MFLTKAPGP-DGFQQSFINIIGELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVK
        +F  + P P  G +  F   I    + G +RP+S+C+VI +   K +A R    +     E Q+A++P   +  N+ +    I   + KR+ K+  +A+ 
Subjt:  MFLTKAPGP-DGFQQSFINIIGELWEVGDYRPISLCNVIYKIVTKTIANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVK

Query:  LDLSKAYNRVEWSFLIAIMARLGFDLRVISYSEGM
        LDL KA+N V  S LI  +   G    V+ Y   M
Subjt:  LDLSKAYNRVEWSFLIAIMARLGFDLRVISYSEGM

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.1e-0837.18Show/hide
Query:  IANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF
        +  RLK +   +I   Q++FIP R   DNI+   E +H ++ K+ G  G + +KLDL KAY+R+ W +L   +   GF
Subjt:  IANRLKLIFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTATCAACTTGTTGATATGGGGTTTTGTGGCCCATGTTTCACTTGGAATGGAAATAGAGGTGCGATTCACATCTGGCTCAAACTCTTTCTGGGAACTCTTCAACTC
TGTATCTATTCTACACTTGGATTGGTTATTTTCAAACCATCATTCAATTATGAAAGTTCTCAGTCTGGGAAGAAGTAATGGTAAAAGGAAGAATTGGCCTTTCAAATTTG
AGGAATTCTGGACCCGTGATGATGAGTGCAAATCCATTATTGAGAAGGCTGGTTTATGGGAGTCCAACCACTGTCAGTTCTCTCTAAAGGAGAGATTGAATAGATGTTCA
AAATCCTTAAAAGTTTCACGTAGGGGGAAAAATAAAAACTTGAAATCAAGAATAATGGAATGTAAATTGGCCCTCCAAGCAGCCTATGATAATCCACATGCGGTAACTAA
TGAGATGAATGACAAGTTGGTTGCTCGATACTCTAAAGAAGAAGTGGAGAGACCCATCAATCAAATGTTCCTCACCAAGGCCCCAGGTCCTGATGGGTTCCAACAATCTT
TTATCAACATTATTGGAGAATTGTGGGAGGTAGGGGACTATAGACCAATTAGTCTTTGTAATGTGATCTACAAGATTGTAACAAAGACTATTGCTAATCGTCTCAAACTG
ATCTTTGATGAGATAATTTCTGAGAATCAATCAGCATTCATACCTAGGAGAGCAATTATTGATAATATAATGATTGGCCATGAATGTATCCATTATCTCAAGAATAAAAG
AATAGGGAAAGATGGTCTGGTGGCTGTGAAGTTGGACTTGAGTAAAGCATACAATAGAGTTGAGTGGTCTTTCCTTATAGCTATTATGGCTCGTTTGGGTTTTGATTTGA
GAGTTATCTCTTATTCAGAGGGAATGAGATTTAGAATTGCTGATGGAAAATCTATTAGAATGTTTCTAAACCCCTGGATCCCAAAAGAGATCACATTTAAGCCAAGATGC
CTTGATGAGAGGAATACTTGTAACAAGGGCAAGAAGCTTATCCCTCACAATTTAAGAAGCAATTGGATCCATTCCTACTATGAGGAATTTGAAGCCTCATTGAAGAGAAA
CTTGTCTTTGAAGTCTGTAGATCATGACAGTTTCAAAATCCAAGCAAAATGTTGGATCCCTCCCCCTTCGGGCTTCTCCAAATTAAATACGGACATTGCATGGAGTCCAT
CTCCCTCTTCTACTATCCTTGGATTAGTCATCCAAGACGATTGTGGCTCCATTTCAGCAATTTCTGCTTCTCATCTCCCTATAGATTTTACTCCCCCTTTAGTTGAAGTT
TTAGCAATCACAGATGGGCTGAAGTTGGCCTTATCTCTTGAGAAGAGATGGTTGATTGTGGAGTCTGATTCACTTCAGGCCATTAATCTGATCGTGAGGAAAGCGGAAGC
CGAAGGTGAAGTTAGCTATTGGATTGAAGAAATTTGCTATTTGAAGGATAAATTTGACTATTGTTTGCTTGGTCATGTGTCACGCTCTTGTAATTTGTTTGTTGATTCAA
TTGCCAAATGGGCAAAATTAGATGAGATAAATGTTGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGACTATCAACTTGTTGATATGGGGTTTTGTGGCCCATGTTTCACTTGGAATGGAAATAGAGGTGCGATTCACATCTGGCTCAAACTCTTTCTGGGAACTCTTCAACTC
TGTATCTATTCTACACTTGGATTGGTTATTTTCAAACCATCATTCAATTATGAAAGTTCTCAGTCTGGGAAGAAGTAATGGTAAAAGGAAGAATTGGCCTTTCAAATTTG
AGGAATTCTGGACCCGTGATGATGAGTGCAAATCCATTATTGAGAAGGCTGGTTTATGGGAGTCCAACCACTGTCAGTTCTCTCTAAAGGAGAGATTGAATAGATGTTCA
AAATCCTTAAAAGTTTCACGTAGGGGGAAAAATAAAAACTTGAAATCAAGAATAATGGAATGTAAATTGGCCCTCCAAGCAGCCTATGATAATCCACATGCGGTAACTAA
TGAGATGAATGACAAGTTGGTTGCTCGATACTCTAAAGAAGAAGTGGAGAGACCCATCAATCAAATGTTCCTCACCAAGGCCCCAGGTCCTGATGGGTTCCAACAATCTT
TTATCAACATTATTGGAGAATTGTGGGAGGTAGGGGACTATAGACCAATTAGTCTTTGTAATGTGATCTACAAGATTGTAACAAAGACTATTGCTAATCGTCTCAAACTG
ATCTTTGATGAGATAATTTCTGAGAATCAATCAGCATTCATACCTAGGAGAGCAATTATTGATAATATAATGATTGGCCATGAATGTATCCATTATCTCAAGAATAAAAG
AATAGGGAAAGATGGTCTGGTGGCTGTGAAGTTGGACTTGAGTAAAGCATACAATAGAGTTGAGTGGTCTTTCCTTATAGCTATTATGGCTCGTTTGGGTTTTGATTTGA
GAGTTATCTCTTATTCAGAGGGAATGAGATTTAGAATTGCTGATGGAAAATCTATTAGAATGTTTCTAAACCCCTGGATCCCAAAAGAGATCACATTTAAGCCAAGATGC
CTTGATGAGAGGAATACTTGTAACAAGGGCAAGAAGCTTATCCCTCACAATTTAAGAAGCAATTGGATCCATTCCTACTATGAGGAATTTGAAGCCTCATTGAAGAGAAA
CTTGTCTTTGAAGTCTGTAGATCATGACAGTTTCAAAATCCAAGCAAAATGTTGGATCCCTCCCCCTTCGGGCTTCTCCAAATTAAATACGGACATTGCATGGAGTCCAT
CTCCCTCTTCTACTATCCTTGGATTAGTCATCCAAGACGATTGTGGCTCCATTTCAGCAATTTCTGCTTCTCATCTCCCTATAGATTTTACTCCCCCTTTAGTTGAAGTT
TTAGCAATCACAGATGGGCTGAAGTTGGCCTTATCTCTTGAGAAGAGATGGTTGATTGTGGAGTCTGATTCACTTCAGGCCATTAATCTGATCGTGAGGAAAGCGGAAGC
CGAAGGTGAAGTTAGCTATTGGATTGAAGAAATTTGCTATTTGAAGGATAAATTTGACTATTGTTTGCTTGGTCATGTGTCACGCTCTTGTAATTTGTTTGTTGATTCAA
TTGCCAAATGGGCAAAATTAGATGAGATAAATGTTGTATGA
Protein sequenceShow/hide protein sequence
MTINLLIWGFVAHVSLGMEIEVRFTSGSNSFWELFNSVSILHLDWLFSNHHSIMKVLSLGRSNGKRKNWPFKFEEFWTRDDECKSIIEKAGLWESNHCQFSLKERLNRCS
KSLKVSRRGKNKNLKSRIMECKLALQAAYDNPHAVTNEMNDKLVARYSKEEVERPINQMFLTKAPGPDGFQQSFINIIGELWEVGDYRPISLCNVIYKIVTKTIANRLKL
IFDEIISENQSAFIPRRAIIDNIMIGHECIHYLKNKRIGKDGLVAVKLDLSKAYNRVEWSFLIAIMARLGFDLRVISYSEGMRFRIADGKSIRMFLNPWIPKEITFKPRC
LDERNTCNKGKKLIPHNLRSNWIHSYYEEFEASLKRNLSLKSVDHDSFKIQAKCWIPPPSGFSKLNTDIAWSPSPSSTILGLVIQDDCGSISAISASHLPIDFTPPLVEV
LAITDGLKLALSLEKRWLIVESDSLQAINLIVRKAEAEGEVSYWIEEICYLKDKFDYCLLGHVSRSCNLFVDSIAKWAKLDEINVV