; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000274 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000274
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:2739537..2742995
RNA-Seq ExpressionLag0000274
SyntenyLag0000274
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4303756.1 unnamed protein product [Prunus armeniaca]8.9e-3729.84Show/hide
Query:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNL-------REGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVIN
        W ++ I   F   +A  IL IP++     D +IW     G ++V+S Y LA +L       R+G  E+SS+ + S S WK +W++   P+ K  +WR  +
Subjt:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNL-------REGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVIN

Query:  NALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAII-----MWNIWAYRN
        N L  + NL +  +     C FC  ++E+ +H+ ++C F++  W      LD +  +  ED+  I  W+ MM  L+  E    AI      +W IW  RN
Subjt:  NALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAII-----MWNIWAYRN

Query:  KILMEGSTADKDQFLRPIDVSIKEH--CARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS
          + +G  AD  + +  +   + E     +  +   L  +SS +P S   W  P    LK+N DA+WS     GG+GW+IRDS G L+ AG +      S
Subjt:  KILMEGSTADKDQFLRPIDVSIKEH--CARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS

Query:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALA
           +E L I+  L+   N  L   ++VESDS   +  LN       +   I+ DI     + + +SF +  R  N  A+++A
Subjt:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALA

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]8.6e-4030.83Show/hide
Query:  MDGQGHWNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINN
        +D +  W  + +   F   D E IL I L S   +D+++W  D KG +SVKS Y LA  L +       + NSS   WK  W + +  + K  +WR + N
Subjt:  MDGQGHWNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINN

Query:  ALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW----MNFLPNLD---SLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYR
         LPT  NL K   +  P C  C+ + E+ SHV+ +CK ++  W    +   P+ D     FS  +E W      E  ++I          +  W IW+ R
Subjt:  ALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW----MNFLPNLD---SLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYR

Query:  NKILMEGSTADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSI
        NK + EG  +D  +FL     S+ +   R+ +  ++     +       W PPS   LKLNVDA+ S   +  G+G ++RD+ G ++A G K    +  +
Subjt:  NKILMEGSTADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSI

Query:  PSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQ
           EA  I  GL    N   ++ L+VESD   VV+ LNN      E H I+ D+   ++   ++ FS+  R  N  A+ALA+ A++
Subjt:  PSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.1e-4736.8Show/hide
Query:  FPPPQPFPQNPMPNLVQYPNPFNPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYLDDQQLQPNPDFLHWERYNRFIKCWIY
        FPPP P        L Q PNPF+ N + TLPQPL++KLND NFLLW+NQLLNA+IANGL+GYLDG+I  PP++LD  QLQPNP +  WERYNR + CWIY
Subjt:  FPPPQPFPQNPMPNLVQYPNPFNPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYLDDQQLQPNPDFLHWERYNRFIKCWIY

Query:  SSLSVEKMSEIVSFESAADIWNSLKRSYDSKTTTRIMGLKTQLQKIKR----------------------------MDNLAY------------------
        SSLS EKM E+VS E+  DIW+SL R YDSKTT RIMGLKT+LQ +++                             D+LA+                  
Subjt:  SSLSVEKMSEIVSFESAADIWNSLKRSYDSKTTTRIMGLKTQLQKIKR----------------------------MDNLAY------------------

Query:  ----------------------------------------------QTPPP----------------------QALLST-----TFTPTPSS--------
                                                      + PPP                      Q++L        + P PSS        
Subjt:  ----------------------------------------------QTPPP----------------------QALLST-----TFTPTPSS--------

Query:  ---------------------VPDTLSTMFTDS----------YHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQASSPPG
                              P  L      S           H D++WF+DSGATHHMT D+S L N TPY+GGEQV VGNGSS+Q  S  G
Subjt:  ---------------------VPDTLSTMFTDS----------YHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQASSPPG

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]1.8e-3729.63Show/hide
Query:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASN-LREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTK
        W  + I   F   DA+ I  IPL+     D IIW  +  G +SVKS Y++A   LR       S  N  +  WK LW + +  + K   WR     LPT+
Subjt:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASN-LREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTK

Query:  LNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIM---WNIWAYRNKILMEGST
        LNL K  ++   FC  C    E+  HV+W+C  ++  W        S   + +      D  +  + +L   E +  A+ +   W +W+ RN +L  G  
Subjt:  LNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIM---WNIWAYRNKILMEGST

Query:  ADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALTIK
          +  +L   DV   E  A++    D LD+     T    W PP+    K+N DA+  +D    G G +IR+ LG ++AA +       S    E L  +
Subjt:  ADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALTIK

Query:  EGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQL
        + +   ++ G N  L++E DS +V+K+L++ + DL     ++ DI         +SFSW  R+ N VA+ALA+ A  L
Subjt:  EGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQL

XP_027118368.1 uncharacterized protein LOC113735569 [Coffea arabica]1.8e-3727.53Show/hide
Query:  MDGQGH-WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSS-RSFWKKLWNIKIIPRAKFCVWRVI
        +DG    W  E ++ +FW  + E I+ IPL  + + D +IW     G+F+VKS YHL      G ++ +S  N S +  W +LW + I  + K  +WR+ 
Subjt:  MDGQGH-WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSS-RSFWKKLWNIKIIPRAKFCVWRVI

Query:  NNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW-MNFLP---NLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRN
           LPT   L   G+ ++P C+ C+   E  +H+   C  +  +W ++FL     LDS   ++        W  G++ +L +++ +  AII+WN+W  RN
Subjt:  NNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW-MNFLP---NLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRN

Query:  KILMEGSTADKDQFLRPID-VSIKEHCARIFRDSDLL-DISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS
          L +GS  D      P+  VS+  +  + +R+++   +   ++  + + W  P    LK N D +   +    G+G ++RD  G+ V   +  +S  +S
Subjt:  KILMEGSTADKDQFLRPID-VSIKEHCARIFRDSDLL-DISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS

Query:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA
           +EA   +  ++  L   +   +++E DS  +V  L     D     +++ DIF+  +N A    +W  R+ N  A+ LARNA
Subjt:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA

TrEMBL top hitse value%identityAlignment
A0A2N9GAE6 Uncharacterized protein1.0e-3828.49Show/hide
Query:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTKL
        W+   I  +F   DAE I  IPL++    D IIW  +  G ++V+S Y    +  +      S  N  +  W+ +W++KI  + +   W+    ALPTKL
Subjt:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTKL

Query:  NLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILS---EEERKKAAIIMWNIWAYRNKILMEGSTA
        NL K  + V   C  C  K+E + H +W CK  +  W N       +++ + ++   +D+ + +  +L    + E +   II W +W  RNKI +     
Subjt:  NLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILS---EEERKKAAIIMWNIWAYRNKILMEGSTA

Query:  DKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALTIKE
          +Q    + +  K +      +++ L  S   P   + W+PP   + K+N D +   +    GIG ++RDS G ++A+  + V    S+PS+EA  +K 
Subjt:  DKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALTIKE

Query:  GLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALAR
         +   L +GL      E DS ++V +LN+P   L    L++ D  + A      SFS   R+ N +A+ALAR
Subjt:  GLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALAR

A0A2N9H727 Uncharacterized protein4.3e-3727.89Show/hide
Query:  MDGQGHWNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINN
        +D  G WN   I  +F   DA+ I  + L+S    D +IW+ +  GT+SV+S Y L            S++   + FWKK+W++++  + +  +WR    
Subjt:  MDGQGHWNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINN

Query:  ALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRNKILMEG
        +LPT  N+ +  +V+   C FC  +EE   HV+W C      W +   ++ +     R  +  +D    + V+ + E   +   + W +W  RN++ M  
Subjt:  ALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRNKILMEG

Query:  STADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALT
        + AD  + +  + V    H A  F  + L   +S      + W P SS   K+N DA+   D +  G+G +IRD  G  +AA  K      S+   EAL 
Subjt:  STADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALT

Query:  IKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQL
          E +   + +G+      E D+ ++  +L   +  L     I+ D+ ++AR     SFS   RE N VA+ LAR A++L
Subjt:  IKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQL

A0A6J1DQX7 uncharacterized protein LOC1110223155.4e-4836.8Show/hide
Query:  FPPPQPFPQNPMPNLVQYPNPFNPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYLDDQQLQPNPDFLHWERYNRFIKCWIY
        FPPP P        L Q PNPF+ N + TLPQPL++KLND NFLLW+NQLLNA+IANGL+GYLDG+I  PP++LD  QLQPNP +  WERYNR + CWIY
Subjt:  FPPPQPFPQNPMPNLVQYPNPFNPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYLDDQQLQPNPDFLHWERYNRFIKCWIY

Query:  SSLSVEKMSEIVSFESAADIWNSLKRSYDSKTTTRIMGLKTQLQKIKR----------------------------MDNLAY------------------
        SSLS EKM E+VS E+  DIW+SL R YDSKTT RIMGLKT+LQ +++                             D+LA+                  
Subjt:  SSLSVEKMSEIVSFESAADIWNSLKRSYDSKTTTRIMGLKTQLQKIKR----------------------------MDNLAY------------------

Query:  ----------------------------------------------QTPPP----------------------QALLST-----TFTPTPSS--------
                                                      + PPP                      Q++L        + P PSS        
Subjt:  ----------------------------------------------QTPPP----------------------QALLST-----TFTPTPSS--------

Query:  ---------------------VPDTLSTMFTDS----------YHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQASSPPG
                              P  L      S           H D++WF+DSGATHHMT D+S L N TPY+GGEQV VGNGSS+Q  S  G
Subjt:  ---------------------VPDTLSTMFTDS----------YHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQASSPPG

A0A6J5WPU6 Reverse transcriptase domain-containing protein4.3e-3729.84Show/hide
Query:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNL-------REGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVIN
        W ++ I   F   +A  IL IP++     D +IW     G ++V+S Y LA +L       R+G  E+SS+ + S S WK +W++   P+ K  +WR  +
Subjt:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNL-------REGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVIN

Query:  NALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAII-----MWNIWAYRN
        N L  + NL +  +     C FC  ++E+ +H+ ++C F++  W      LD +  +  ED+  I  W+ MM  L+  E    AI      +W IW  RN
Subjt:  NALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAII-----MWNIWAYRN

Query:  KILMEGSTADKDQFLRPIDVSIKEH--CARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS
          + +G  AD  + +  +   + E     +  +   L  +SS +P S   W  P    LK+N DA+WS     GG+GW+IRDS G L+ AG +      S
Subjt:  KILMEGSTADKDQFLRPIDVSIKEH--CARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS

Query:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALA
           +E L I+  L+   N  L   ++VESDS   +  LN       +   I+ DI     + + +SF +  R  N  A+++A
Subjt:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALA

A0A6P6WSG1 uncharacterized protein LOC1137355698.7e-3827.53Show/hide
Query:  MDGQGH-WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSS-RSFWKKLWNIKIIPRAKFCVWRVI
        +DG    W  E ++ +FW  + E I+ IPL  + + D +IW     G+F+VKS YHL      G ++ +S  N S +  W +LW + I  + K  +WR+ 
Subjt:  MDGQGH-WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSS-RSFWKKLWNIKIIPRAKFCVWRVI

Query:  NNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW-MNFLP---NLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRN
           LPT   L   G+ ++P C+ C+   E  +H+   C  +  +W ++FL     LDS   ++        W  G++ +L +++ +  AII+WN+W  RN
Subjt:  NNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSW-MNFLP---NLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRN

Query:  KILMEGSTADKDQFLRPID-VSIKEHCARIFRDSDLL-DISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS
          L +GS  D      P+  VS+  +  + +R+++   +   ++  + + W  P    LK N D +   +    G+G ++RD  G+ V   +  +S  +S
Subjt:  KILMEGSTADKDQFLRPID-VSIKEHCARIFRDSDLL-DISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS

Query:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA
           +EA   +  ++  L   +   +++E DS  +V  L     D     +++ DIF+  +N A    +W  R+ N  A+ LARNA
Subjt:  IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-0929.03Show/hide
Query:  NPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYL-DDQQLQPNPDFLHWERYNRFIKCWIYSSLSVEKMSEIVSFESAADIW
        N  S L +      KL  TN+L+W  Q+        L G+LDGS   PP  +  D   + NPD+  W+R ++ I   +  ++S+     +    +AA IW
Subjt:  NPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYL-DDQQLQPNPDFLHWERYNRFIKCWIYSSLSVEKMSEIVSFESAADIW

Query:  NSLKRSYDSKTTTRIMGLKTQLQK
         +L++ Y + +   +  L+TQL++
Subjt:  NSLKRSYDSKTTTRIMGLKTQLQK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-0542.19Show/hide
Query:  PTPSSVPDTLSTMFTDSYHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQAS
        P+P +     + +   S +   NW LDSGATHH+T D +NL    PY GG+ V+V +GS+I  S
Subjt:  PTPSSVPDTLSTMFTDSYHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-0436.84Show/hide
Query:  YQTPPPQALLSTTFTPTPSSVPDTLSTMFTDSYHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSI
        +Q+   Q   ++ FTP         + +  +S +   NW LDSGATHH+T D +NL    PY GG+ V++ +GS+I
Subjt:  YQTPPPQALLSTTFTPTPSSVPDTLSTMFTDSYHLDKNWFLDSGATHHMTHDASNLYNLTPYNGGEQVIVGNGSSI

Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0527.78Show/hide
Query:  SHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS---IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNL
        +H  W  P    +K N D S+ +       GW++RDS GS + AG +++ RK        ++AL I   +    + G    +  E D+  +   +N   +
Subjt:  SHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWS---IPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNL

Query:  DLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQ
          F  H  ++DI  W R    I F+W  R  N  A+ LA+  +Q
Subjt:  DLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQ

AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.8e-0721.43Show/hide
Query:  MWNIWAYRNKILMEGSTADKDQFLRPI--DVSIKEHCARIFRDSDLLDI---SSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLV
        MW +W  RN+++ +         L+    DV   E+     +  +   +    +   ++H  W  P    +K N D S++   +    GW+IRD  G   
Subjt:  MWNIWAYRNKILMEGSTADKDQFLRPI--DVSIKEHCARIFRDSDLLDI---SSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLV

Query:  AAGNKSVSRKWSIPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARN
         A         +    E   +   +    + G    ++ E DS  V + LN   +  F     +++ + W++   E+ FSW  R  N  A+ LA++
Subjt:  AAGNKSVSRKWSIPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARN

AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-2122.92Show/hide
Query:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTKL
        W++  I       D   I  I LA +   D IIW  +  G ++V+S Y L ++       + +  + S     ++WN+ I+P+ K  +WR ++ AL T  
Subjt:  WNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTKL

Query:  NLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVIL--------SEEERKKAAIIMWNIWAYRNKILM
         L   GM ++P C  C  + ES +H ++ C F+ ++W        S  S+ R      D+ E +  IL        S+  +     ++W IW  RN ++ 
Subjt:  NLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVIL--------SEEERKKAAIIMWNIWAYRNKILM

Query:  EGSTADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMC-----WVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSI
                + +  +    + H    + ++      + SPT  +      W  P +  +K N DA +         GW+IR+  G+ ++ G+  ++   + 
Subjt:  EGSTADKDQFLRPIDVSIKEHCARIFRDSDLLDISSKSPTSHMC-----WVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSI

Query:  PSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLI--MKDIFIWARNCAEISFSWCCRERNGVANALAR
           E   +   L      G    + +E D  +++  +N  +   F + L   ++DI  WA   A I F +  R+ N +A+ LA+
Subjt:  PSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLNNPNLDLFETHLI--MKDIFIWARNCAEISFSWCCRERNGVANALAR

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.2e-0835Show/hide
Query:  LWNIKIIPRAKFCVWRVINNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSK
        +W++KI P+ K  +W+ +NNALP    L+   + + PFC  CR   E+ +H+++ C F++
Subjt:  LWNIKIIPRAKFCVWRVINNALPTKLNLIKIGMVVNPFCVFCRCKEESSSHVVWKCKFSK

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.3e-1323.76Show/hide
Query:  IMWNIWAYRNKILMEGSTADKDQFLRPIDVSI---KEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVA
        +MW IW   N ++   +   + +F   +++++   KE       +       +  P+ +  W PP  ++LK N DAS  +     G+GW++R+S G+++ 
Subjt:  IMWNIWAYRNKILMEGSTADKDQFLRPIDVSI---KEHCARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVA

Query:  AGNKSVSRKWSIPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLN----NPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA
         G      + +    E  T+   + +    G +  ++ E D+ ++ + +N    NP L  F     +  I  W  +   I FS+  RE+NG A+ LA+ A
Subjt:  AGNKSVSRKWSIPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSLN----NPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNA

Query:  VQ
        ++
Subjt:  VQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTCAGGGCCATTGGAATGAAGAGTTCATTAGAGGGATGTTTTGGTTGATGGATGCTGAGGATATTCTGGACATTCCTTTGGCCTCCACTCCCTCTAAGGATGA
TATTATTTGGGCCTTGGATAACAAGGGCACGTTCTCGGTGAAATCTACTTATCATCTTGCTAGCAACCTTAGGGAGGGGTTGAAGGAATCCTCGTCGAATGATAATTCCT
CTAGGTCCTTTTGGAAAAAGCTCTGGAATATTAAGATCATTCCTAGAGCTAAATTTTGTGTTTGGAGAGTTATCAATAATGCTCTCCCTACTAAACTTAATCTAATTAAG
ATAGGGATGGTTGTTAATCCGTTTTGTGTGTTTTGCAGGTGCAAAGAGGAATCCTCGTCTCATGTGGTGTGGAAATGTAAGTTCTCTAAGCTATCGTGGATGAATTTTCT
CCCTAACTTGGATAGTTTGTTCTCTATCAACAGGGAGGATTGGAGGCCAATTGACTGGTGGGAAGGAATGATGGTTATTTTGTCTGAAGAAGAAAGGAAGAAAGCGGCTA
TCATCATGTGGAACATTTGGGCTTATCGCAACAAAATTCTTATGGAAGGTTCAACAGCAGACAAGGATCAATTTCTCAGACCAATTGATGTCAGTATCAAGGAGCATTGC
GCTAGAATTTTTAGGGATTCTGACCTGTTGGATATCAGCTCGAAGAGCCCCACGAGTCACATGTGTTGGGTTCCTCCCTCTAGCGAGCAGTTGAAGCTTAATGTGGATGC
CTCATGGAGCGACGACTGCAAGGCTGGGGGCATAGGATGGATGATCCGTGACTCTCTGGGATCTTTAGTAGCTGCTGGGAACAAAAGTGTTAGTAGAAAGTGGTCAATTC
CCTCGCTTGAAGCTCTAACGATCAAAGAAGGGCTAACGAGTTACCTTAATTTAGGCCTCAATGCCCCACTGGTGGTTGAATCAGACTCTGCATCCGTTGTGAAATCGCTC
AATAATCCTAATTTGGACCTCTTTGAAACCCACTTGATTATGAAGGATATTTTTATCTGGGCTAGGAATTGTGCAGAGATCTCTTTCTCTTGGTGCTGTAGAGAAAGGAA
TGGGGTGGCGAATGCCTTGGCACGGAATGCTGTCCAGTTGATGTTGTCTCGACCACAGCTGGAGAATAAGGAAGATGCAAGAAGTTGCAGCAACTATTTTTCCCGAGAGG
AGAGCGTTCAGACGTTGTGTTGCTATGTTGAGAGCGTCTCGACGCTCTCCAATTTTCCATATCTGATTAAGAGGCAGCAACAGGGCAGCGTCGAGGCGCTGTGGCAGATT
TTTATAAATATGAAACGATTTTCAGTCGTAAAAAGGGAGTTCAATTTGGGAGATCCTCGTCCCCAATATTTTCCACCTCCACAACCATTTCCGCAAAATCCAATGCCAAA
TCTTGTTCAATATCCAAATCCATTTAACCCTAACTCATACCTGACTTTACCTCAGCCCCTATCAATGAAGCTGAATGACACCAACTTCCTCCTCTGGAGAAACCAACTAT
TGAATGCGATGATTGCAAATGGCCTTCAAGGTTATCTCGATGGTTCAATTGCAGCTCCTCCCAGGTATCTCGATGATCAACAACTTCAACCAAATCCAGATTTTCTCCAT
TGGGAAAGGTATAATCGGTTTATCAAGTGTTGGATATACTCCTCTCTATCAGTAGAAAAAATGAGTGAGATAGTTAGTTTTGAGTCTGCTGCTGATATTTGGAATTCTTT
GAAACGTTCCTATGATTCTAAAACTACGACTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATTAAAAGGATGGACAATTTAGCCTATCAAACCCCACCACCTCAAG
CTTTGTTATCTACTACGTTCACTCCCACTCCCTCCTCTGTTCCTGACACTTTATCCACAATGTTCACTGATTCCTATCACCTTGATAAAAATTGGTTTTTAGATTCCGGG
GCCACTCATCACATGACTCATGATGCATCGAACCTTTACAATCTAACACCCTACAATGGTGGTGAGCAAGTCATTGTGGGAAATGGATCTTCAATCCAAGCAAGTTCTCC
TCCAGGGCATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGTCAGGGCCATTGGAATGAAGAGTTCATTAGAGGGATGTTTTGGTTGATGGATGCTGAGGATATTCTGGACATTCCTTTGGCCTCCACTCCCTCTAAGGATGA
TATTATTTGGGCCTTGGATAACAAGGGCACGTTCTCGGTGAAATCTACTTATCATCTTGCTAGCAACCTTAGGGAGGGGTTGAAGGAATCCTCGTCGAATGATAATTCCT
CTAGGTCCTTTTGGAAAAAGCTCTGGAATATTAAGATCATTCCTAGAGCTAAATTTTGTGTTTGGAGAGTTATCAATAATGCTCTCCCTACTAAACTTAATCTAATTAAG
ATAGGGATGGTTGTTAATCCGTTTTGTGTGTTTTGCAGGTGCAAAGAGGAATCCTCGTCTCATGTGGTGTGGAAATGTAAGTTCTCTAAGCTATCGTGGATGAATTTTCT
CCCTAACTTGGATAGTTTGTTCTCTATCAACAGGGAGGATTGGAGGCCAATTGACTGGTGGGAAGGAATGATGGTTATTTTGTCTGAAGAAGAAAGGAAGAAAGCGGCTA
TCATCATGTGGAACATTTGGGCTTATCGCAACAAAATTCTTATGGAAGGTTCAACAGCAGACAAGGATCAATTTCTCAGACCAATTGATGTCAGTATCAAGGAGCATTGC
GCTAGAATTTTTAGGGATTCTGACCTGTTGGATATCAGCTCGAAGAGCCCCACGAGTCACATGTGTTGGGTTCCTCCCTCTAGCGAGCAGTTGAAGCTTAATGTGGATGC
CTCATGGAGCGACGACTGCAAGGCTGGGGGCATAGGATGGATGATCCGTGACTCTCTGGGATCTTTAGTAGCTGCTGGGAACAAAAGTGTTAGTAGAAAGTGGTCAATTC
CCTCGCTTGAAGCTCTAACGATCAAAGAAGGGCTAACGAGTTACCTTAATTTAGGCCTCAATGCCCCACTGGTGGTTGAATCAGACTCTGCATCCGTTGTGAAATCGCTC
AATAATCCTAATTTGGACCTCTTTGAAACCCACTTGATTATGAAGGATATTTTTATCTGGGCTAGGAATTGTGCAGAGATCTCTTTCTCTTGGTGCTGTAGAGAAAGGAA
TGGGGTGGCGAATGCCTTGGCACGGAATGCTGTCCAGTTGATGTTGTCTCGACCACAGCTGGAGAATAAGGAAGATGCAAGAAGTTGCAGCAACTATTTTTCCCGAGAGG
AGAGCGTTCAGACGTTGTGTTGCTATGTTGAGAGCGTCTCGACGCTCTCCAATTTTCCATATCTGATTAAGAGGCAGCAACAGGGCAGCGTCGAGGCGCTGTGGCAGATT
TTTATAAATATGAAACGATTTTCAGTCGTAAAAAGGGAGTTCAATTTGGGAGATCCTCGTCCCCAATATTTTCCACCTCCACAACCATTTCCGCAAAATCCAATGCCAAA
TCTTGTTCAATATCCAAATCCATTTAACCCTAACTCATACCTGACTTTACCTCAGCCCCTATCAATGAAGCTGAATGACACCAACTTCCTCCTCTGGAGAAACCAACTAT
TGAATGCGATGATTGCAAATGGCCTTCAAGGTTATCTCGATGGTTCAATTGCAGCTCCTCCCAGGTATCTCGATGATCAACAACTTCAACCAAATCCAGATTTTCTCCAT
TGGGAAAGGTATAATCGGTTTATCAAGTGTTGGATATACTCCTCTCTATCAGTAGAAAAAATGAGTGAGATAGTTAGTTTTGAGTCTGCTGCTGATATTTGGAATTCTTT
GAAACGTTCCTATGATTCTAAAACTACGACTAGGATTATGGGACTCAAAACTCAACTTCAAAAAATTAAAAGGATGGACAATTTAGCCTATCAAACCCCACCACCTCAAG
CTTTGTTATCTACTACGTTCACTCCCACTCCCTCCTCTGTTCCTGACACTTTATCCACAATGTTCACTGATTCCTATCACCTTGATAAAAATTGGTTTTTAGATTCCGGG
GCCACTCATCACATGACTCATGATGCATCGAACCTTTACAATCTAACACCCTACAATGGTGGTGAGCAAGTCATTGTGGGAAATGGATCTTCAATCCAAGCAAGTTCTCC
TCCAGGGCATCCTTGA
Protein sequenceShow/hide protein sequence
MDGQGHWNEEFIRGMFWLMDAEDILDIPLASTPSKDDIIWALDNKGTFSVKSTYHLASNLREGLKESSSNDNSSRSFWKKLWNIKIIPRAKFCVWRVINNALPTKLNLIK
IGMVVNPFCVFCRCKEESSSHVVWKCKFSKLSWMNFLPNLDSLFSINREDWRPIDWWEGMMVILSEEERKKAAIIMWNIWAYRNKILMEGSTADKDQFLRPIDVSIKEHC
ARIFRDSDLLDISSKSPTSHMCWVPPSSEQLKLNVDASWSDDCKAGGIGWMIRDSLGSLVAAGNKSVSRKWSIPSLEALTIKEGLTSYLNLGLNAPLVVESDSASVVKSL
NNPNLDLFETHLIMKDIFIWARNCAEISFSWCCRERNGVANALARNAVQLMLSRPQLENKEDARSCSNYFSREESVQTLCCYVESVSTLSNFPYLIKRQQQGSVEALWQI
FINMKRFSVVKREFNLGDPRPQYFPPPQPFPQNPMPNLVQYPNPFNPNSYLTLPQPLSMKLNDTNFLLWRNQLLNAMIANGLQGYLDGSIAAPPRYLDDQQLQPNPDFLH
WERYNRFIKCWIYSSLSVEKMSEIVSFESAADIWNSLKRSYDSKTTTRIMGLKTQLQKIKRMDNLAYQTPPPQALLSTTFTPTPSSVPDTLSTMFTDSYHLDKNWFLDSG
ATHHMTHDASNLYNLTPYNGGEQVIVGNGSSIQASSPPGHP