; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09630 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09630
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr02:13785539..13791987
RNA-Seq ExpressionClc02G09630
SyntenyClc02G09630
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7547891.1 Ribonuclease H-like superfamily [Arabidopsis suecica]2.4e-11042.93Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        MR++LP   WGHA+LH + L  I+  + HKYS  QL+ G EP+ISH +IFGCAVYVPIA   RTKMGPQRR+GIYVG+DSP+IIKYLEP TGD+F AR+A
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQ-QVVTNESGTHQKR
        DCHFNE  FPTLGG   KL +EI+WN + L+  DPRT   E EV KII+LQ +AN+LPD+F+D KKVTKS+IP  N P +ID+      V  ES   +KR
Subjt:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQ-QVVTNESGTHQKR

Query:  GRPVGSKDKNPRKRK---INDSKKDLVE-------------------------------------------DVNIHE-----------------------
        GRP+GSKDKNP+K K   I +  K+ ++                                           DV+I E                       
Subjt:  GRPVGSKDKNPRKRK---INDSKKDLVE-------------------------------------------DVNIHE-----------------------

Query:  ----------------------------------------------------------------KTLDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYL
                                                                        ++L +  +P   + + EE+LGP+VPYLSAIGA MYL
Subjt:  ----------------------------------------------------------------KTLDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYL

Query:  ANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSN
        A++TRPDI F+VNLL+R++S PT+RHWNGIKH+L YL+GT+D+                               GY+FT GGTAISWR +KQTI+ATSSN
Subjt:  ANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSN

Query:  HAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH
        HAEILAIHEASR+CVWLRSMTQHI     +   K  PT +YEDN A     A  +  ++ G  T+H
Subjt:  HAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH

KAG7588838.1 Ribonuclease H-like superfamily [Arabidopsis suecica]1.6e-13051.25Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        +R KLP   WGHAILH A+L  I+  AY+KYS LQL  GQEP+++H R+FGCAVYVPIA  QRTKMGPQRRLGIYVGY SPSII+YLEP TGD+FTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQV---VTNESGTHQ
        DCHFNE  FP LGGGIK++ K+I W+   L +LDP TNQ ELEVQKIIHLQN+ANQLPDAF D K+VTKS+IP  NVPS+++IP ++     TNE     
Subjt:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQV---VTNESGTHQ

Query:  KRGRPVGSKDKNPR-KRKINDSKKDLVEDVN---------------------------------------------IHE------KTLDMIDNPISEEDN
        K GRP+GSK+KNPR K+KI D  + + E+ N                                              HE      ++L++ ++P+   ++
Subjt:  KRGRPVGSKDKNPR-KRKINDSKKDLVEDVN---------------------------------------------IHE------KTLDMIDNPISEEDN

Query:  NEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFT
        NEE+LGP+VPY+SAIG  MYLAN TRPDIAFS NLLARYNSSPT RHWNGIKHI  YL+GTID+                               GY+FT
Subjt:  NEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFT

Query:  CGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTA-----KHWNAATQRSQHLTG---SHTRHHSQPR
         GGTAISWR  KQT+ ATSSNHAE++A+HEA R+C+WLRSMT HI E   + SSK  PTTL+EDN A     K     + R++H+     S+T+   + +
Subjt:  CGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTA-----KHWNAATQRSQHLTG---SHTRHHSQPR

Query:  GADVVVVSHVDPAFDVRTSPL
          D+  +   D A D+ T  L
Subjt:  GADVVVVSHVDPAFDVRTSPL

KAG7594482.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]2.3e-11348.76Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        MR++LP   WGHA+LH + L  I+  + HKYS  +L+ G EP+ISH +IFGCAVYVPIAL  RTKMGPQRR+GIYVG+DSP+IIKYLEP TGD+F AR+A
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQ-QVVTNESGTHQKR
        DCHFNE  FPTLGG   KL +EI+WN + L+  DPRT   E EV KII+LQ +AN+LPD+F+D KKVTKS+IP  N P +ID+  +      ES   +KR
Subjt:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQ-QVVTNESGTHQKR

Query:  GRPV------------GSKDKNPRKRKINDSKKDL----------VEDVN----IHEK-------------------------TLDMIDNPISEEDNNEE
        GRP+            G   +     K     KDL          +E VN    +H+K                         +L +  +P   + + EE
Subjt:  GRPV------------GSKDKNPRKRKINDSKKDL----------VEDVN----IHEK-------------------------TLDMIDNPISEEDNNEE

Query:  LLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGG
        +LGP+VPYLSAIGA MYLA++TRPDI F+VNLL+R++S PT+RHWNGIKH+L YL+GT+D+                               GY+FT GG
Subjt:  LLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGG

Query:  TAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH
        TAISWR +KQTI+ATSSNHAEILAIHEASR+CVWLRSMTQH+     +   K  PT +YEDN A     A  +  ++ G  T+H
Subjt:  TAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH

RVW59189.1 Copia protein [Vitis vinifera]5.0e-10837.45Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H A+L  I+   YH+YS  QLV G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLG+YVG+DSPSII+YLEPLTGDVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        DCHFNE  FP+LG    I +  +EI+W  S ++HLDPRTNQ ELEVQ+IIHLQN+ANQLPDAFID KKVTKSHIP  N P++ID+P  Q +TNES    K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRK------------------------------------------------------------------------------------
        RGRPVGSKD  PRKR+                                                                                    
Subjt:  RGRPVGSKDKNPRKRK------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------INDSKKDLVEDVN-------------------------------------------IHE
                                                  N++K   V+D+N                                           +H+
Subjt:  -----------------------------------------INDSKKDLVEDVN-------------------------------------------IHE

Query:  KT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRG
         T                         LD+  +P    + +EELLGP+VPYLSAIGA MYLAN TRPDIAFSVNLLARY+S+PT+RHWNGIKHIL YLRG
Subjt:  KT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRG

Query:  TIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTT
        T D+                               GY+F C GTAISWR VKQT+ ATSSNH+EILAIHEASR+C+WLRSM QHI E+  L S K  PTT
Subjt:  TIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTT

Query:  LYEDNTAKHWNAATQRSQHLTGSHTRHHS
        L+EDN A     A     ++ G  T+H S
Subjt:  LYEDNTAKHWNAATQRSQHLTGSHTRHHS

RVW80615.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.9e-10839.45Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H A+L  I+   YH+YS  QLV G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLG+YVG+DSPSII+YLEPLTGDVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        DCHFNE  FP+LG    I +  +EI+W  S ++HLDPRTNQ ELEVQ+IIHLQN+ANQLPDAFID KKVTKSHIP  N P++ID+P  Q +TNE     K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRKIND---------------------------------------------------------------------------------
         GRPVGSKD  PRKR+  +                                                                                 
Subjt:  RGRPVGSKDKNPRKRKIND---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------SKKDL-------------------------------------
                                                                  +K+++                                     
Subjt:  ----------------------------------------------------------SKKDL-------------------------------------

Query:  -----VEDVN-------------------------------------------IHEKT-------------------------LDMIDNPISEEDNNEEL
             V+D+N                                           +H+ T                         LD+  +P    + +EEL
Subjt:  -----VEDVN-------------------------------------------IHEKT-------------------------LDMIDNPISEEDNNEEL

Query:  LGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDVGYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKC
        LGP+VPYLSA+GA MYLAN TRPDIAFSVNLLARY+S+PT+RHWN   ++    +G    GY+F C GTAISWR VKQT+ ATSSNH+EILAIHEASR+C
Subjt:  LGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDVGYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKC

Query:  VWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS
        +WLRSM QHI E+  L S K  PTTL+EDN A     A     ++ G  T+H S
Subjt:  VWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS

TrEMBL top hitse value%identityAlignment
A0A151UBQ7 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)4.2e-10838.61Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        MR+KLP   WGHAILH A L  I+  +YHK+S LQLV+G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLGIYVGY+SPSIIKYLEPLTGD+FTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQV-VTNESGTHQKR
        DCHFNE+ FPTLGG  K+LEK+I+WN   LSHLDPRT Q ELEVQKIIHLQN+ANQLP+ F D K+VTKS++P  N P ++++P  QV  TNES    KR
Subjt:  DCHFNEINFPTLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQV-VTNESGTHQKR

Query:  G------------------------------------------------------RPVGSKDKNPRKRKINDS----KKDLV------------------
                                                               +P+G K    RKR  N      K  LV                  
Subjt:  G------------------------------------------------------RPVGSKDKNPRKRKINDS----KKDLV------------------

Query:  ----------------EDVNIH------------------------------------------------------------------------------
                        E++N+H                                                                              
Subjt:  ----------------EDVNIH------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------EKTLDMIDNPISEEDNNEELLGPKVPYLSAIGA
                                                                            ++LD+  +P   ++ +EE+LGP+VPYLSAIG 
Subjt:  -------------------------------------------------------------------EKTLDMIDNPISEEDNNEELLGPKVPYLSAIGA

Query:  -MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITA
         MYLAN TRPDI F+VNLLARY+SSPT+RHWNG+K IL YLRGT+D+                               GYLFT GGT ISWR VKQTI+ 
Subjt:  -MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITA

Query:  TSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH--------HSQPRGADVVV
        TSSNHAEILA+HEASR CVWLR + QHI ET  L S K++ T +YEDN A     A  + +++ G  T+H        H   R  D+ V
Subjt:  TSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRH--------HSQPRGADVVV

A0A438FGR3 Copia protein2.4e-10837.45Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H A+L  I+   YH+YS  QLV G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLG+YVG+DSPSII+YLEPLTGDVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        DCHFNE  FP+LG    I +  +EI+W  S ++HLDPRTNQ ELEVQ+IIHLQN+ANQLPDAFID KKVTKSHIP  N P++ID+P  Q +TNES    K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRK------------------------------------------------------------------------------------
        RGRPVGSKD  PRKR+                                                                                    
Subjt:  RGRPVGSKDKNPRKRK------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------INDSKKDLVEDVN-------------------------------------------IHE
                                                  N++K   V+D+N                                           +H+
Subjt:  -----------------------------------------INDSKKDLVEDVN-------------------------------------------IHE

Query:  KT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRG
         T                         LD+  +P    + +EELLGP+VPYLSAIGA MYLAN TRPDIAFSVNLLARY+S+PT+RHWNGIKHIL YLRG
Subjt:  KT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRG

Query:  TIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTT
        T D+                               GY+F C GTAISWR VKQT+ ATSSNH+EILAIHEASR+C+WLRSM QHI E+  L S K  PTT
Subjt:  TIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTT

Query:  LYEDNTAKHWNAATQRSQHLTGSHTRHHS
        L+EDN A     A     ++ G  T+H S
Subjt:  LYEDNTAKHWNAATQRSQHLTGSHTRHHS

A0A438H801 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-10839.45Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H A+L  I+   YH+YS  QLV G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLG+YVG+DSPSII+YLEPLTGDVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        DCHFNE  FP+LG    I +  +EI+W  S ++HLDPRTNQ ELEVQ+IIHLQN+ANQLPDAFID KKVTKSHIP  N P++ID+P  Q +TNE     K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRKIND---------------------------------------------------------------------------------
         GRPVGSKD  PRKR+  +                                                                                 
Subjt:  RGRPVGSKDKNPRKRKIND---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------SKKDL-------------------------------------
                                                                  +K+++                                     
Subjt:  ----------------------------------------------------------SKKDL-------------------------------------

Query:  -----VEDVN-------------------------------------------IHEKT-------------------------LDMIDNPISEEDNNEEL
             V+D+N                                           +H+ T                         LD+  +P    + +EEL
Subjt:  -----VEDVN-------------------------------------------IHEKT-------------------------LDMIDNPISEEDNNEEL

Query:  LGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDVGYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKC
        LGP+VPYLSA+GA MYLAN TRPDIAFSVNLLARY+S+PT+RHWN   ++    +G    GY+F C GTAISWR VKQT+ ATSSNH+EILAIHEASR+C
Subjt:  LGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDVGYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKC

Query:  VWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS
        +WLRSM QHI E+  L S K  PTTL+EDN A     A     ++ G  T+H S
Subjt:  VWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS

A0A438HAQ6 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-10436.95Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H A+L  I+   YH+YS  QLV G++PNISH RIFGCAVYVPIA  QRTKMGPQRRLG+YVG+DSPSII+YLEPLTGDVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        DCHFNE  FP LG    I +  +EI+W AS ++HLDPRTNQ ELEVQ+IIHLQN+ANQL DAFID KKVTKSHIP  N P++ID+P +Q +TNES    K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRKIND-----------------SKKDLVEDVNIHEKTLD-----------------------------------------------
        RGRPVGS D  PRKR+  +                  K   +E+  I +KTL+                                               
Subjt:  RGRPVGSKDKNPRKRKIND-----------------SKKDLVEDVNIHEKTLD-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------MIDNPI----------------------------SEE---
                                                                     ++NPI                             EE   
Subjt:  ------------------------------------------------------------MIDNPI----------------------------SEE---

Query:  -------------------------------------DNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGT
                                             + +EELLGP+VPYLSAIGA MYLAN TRP+IAFSVNLLARY+S+PT+RHWNGIK+IL YLRGT
Subjt:  -------------------------------------DNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGT

Query:  IDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTL
         D+                               GY+F C GTAISWR VKQT+ ATSSNH+EI AIHEASR+C+WLRSM QHI E+  L S K  P TL
Subjt:  IDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTL

Query:  YEDNTAKHWNAATQRSQHLTGSHTRHHS
        +EDN A     A     ++ G  T+H S
Subjt:  YEDNTAKHWNAATQRSQHLTGSHTRHHS

A5BJH7 Uncharacterized protein9.3e-10845.17Show/hide
Query:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA
        M+ KLPT  WGHAI+H  +L  I+ + YH+YS  QLV G++PNISH RIFGCAVYVPI    RTKM          G+DSPSII YLEP T DVFTARFA
Subjt:  MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFA

Query:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK
        D HFNE  FP+LG    I +  +EI+W  S ++HLDPRTNQ ELEVQ+I+HLQN+AN+LPDAFID KKVTKSHIP  N P+ ID+     +TNES    K
Subjt:  DCHFNEINFPTLG--GGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQK

Query:  RGRPVGSKDKNPRKRKINDSKKD---------------------------------------------LVEDVN--------------------------
         GRPVGSKDK P +  I     +                                              V+D+N                          
Subjt:  RGRPVGSKDKNPRKRKINDSKKD---------------------------------------------LVEDVN--------------------------

Query:  -----------------IHEKT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYN
                         +H+ T                         LD+  +P    +N+EELLGPKVPYLSAIGA MYLAN TRPDIAFSVNLLARYN
Subjt:  -----------------IHEKT-------------------------LDMIDNPISEEDNNEELLGPKVPYLSAIGA-MYLANNTRPDIAFSVNLLARYN

Query:  SSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRS
        S PT+RHWNGIKHIL YLRGT D+                               GY+F C GTAISWR VKQT+  TSSNH+EILAIHEAS +C+  RS
Subjt:  SSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRS

Query:  MTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS
        M QHI E+  L S K  PT L+EDN A     A     ++ G  T+H S
Subjt:  MTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHS

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.0e-1129.28Show/hide
Query:  IDNPISEEDNNEELLGPK---VPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-----------------------
        +  P+  + N E L   +    P  S IG  MY+   TRPD+  +VN+L+RY+S      W  +K +L YL+GTID+                       
Subjt:  IDNPISEEDNNEELLGPK---VPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-----------------------

Query:  ----------GYLFTC-GGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDN
                  GYLF       I W   +Q   A SS  AE +A+ EA R+ +WL+ +   I    +      +P  +YEDN
Subjt:  ----------GYLFTC-GGTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1536.23Show/hide
Query:  KVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTI------------------------------DVGYLFTCGGTAISW
        KVPY SA+G+ MY    TRPDIA +V +++R+  +P K HW  +K IL YLRGT                                 GYLFT  G AISW
Subjt:  KVPYLSAIGA-MYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTI------------------------------DVGYLFTCGGTAISW

Query:  RLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIG
        +   Q   A S+  AE +A  E  ++ +WL+   Q +G
Subjt:  RLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-1027.19Show/hide
Query:  GPKVP----YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTID-------------------------------VGYLFTCG
        G K+P    Y   +G++     TRPD++++VN L++Y   PT  HWN +K +L YL GT D                                GY+   G
Subjt:  GPKVP----YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTID-------------------------------VGYLFTCG

Query:  GTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAAT----QRSQHLTGSH--TRHHSQPRGADV
           ISW   KQ     SS  AE  ++   S +  W+ S+   +G           P  +Y DN    +  A      R +H+   +   R+  Q     V
Subjt:  GTAISWRLVKQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAAT----QRSQHLTGSH--TRHHSQPRGADV

Query:  VVVSHVDPAFDVRTSPL
        V VS  D   D  T PL
Subjt:  VVVSHVDPAFDVRTSPL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.3e-0826Show/hide
Query:  YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLV
        Y   IG +     TR DI+F+VN L++++ +P   H   +  IL Y++GT+                                 GY    G + ISW+  
Subjt:  YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDV-------------------------------GYLFTCGGTAISWRLV

Query:  KQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAAT----QRSQHLTGSHTRHHSQPRGADVVVVSHVDPAFD
        KQ + + SS  AE  A+  A+ + +WL    + +     L  SK  PT L+ DNTA    A      +R++H+      H  + R      +S+   A+D
Subjt:  KQTITATSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAAT----QRSQHLTGSHTRHHSQPRGADVVVVSHVDPAFD

ATMG00810.1 DNA/RNA polymerases superfamily protein4.2e-0423.02Show/hide
Query:  YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTI-------------------------------DVGYLFTCGGTAISWRLV
        + S +GA+     TRPDI+++VN++ +    PT   ++ +K +L Y++GTI                                 G+    G   ISW   
Subjt:  YLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTI-------------------------------DVGYLFTCGGTAISWRLV

Query:  KQTITATSSNHAEILAIHEASRKCVW
        +Q   + SS   E  A+   + +  W
Subjt:  KQTITATSSNHAEILAIHEASRKCVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCGAAGCTTCCTACATTTGGATGGGGACATGCTATTTTGCATGTTGCATCACTTGCACATATTAAGTCAGTAGCTTATCATAAATACTCATCATTACAATTAGT
TTATGGCCAAGAGCCAAATATTTCTCATCCAAGAATTTTTGGATGTGCAGTATATGTTCCAATTGCTCTACTACAACGTACTAAGATGGGTCCTCAAAGGAGGTTAGGAA
TATACGTTGGATATGATTCCCCATCAATTATTAAATATCTTGAACCCCTGACGGGTGATGTATTTACTGCACGATTTGCTGATTGTCATTTTAATGAGATTAATTTTCCA
ACATTAGGGGGAGGAATTAAGAAGTTGGAAAAGGAAATTGCATGGAATGCATCATTATTGTCTCATTTAGATCCCCGTACAAATCAATTTGAACTTGAAGTTCAGAAAAT
AATTCATTTGCAAAATATAGCAAATCAATTACCAGATGCGTTTATAGATGCAAAGAAAGTAACTAAATCACATATACCCATTACAAATGTTCCGTCGAAAATTGATATCC
CAACACAGCAAGTTGTCACTAATGAATCTGGAACACACCAGAAGCGTGGTAGACCAGTGGGTTCCAAAGATAAAAATCCTCGGAAAAGAAAAATAAATGATAGTAAAAAA
GACCTTGTTGAGGATGTAAATATCCACGAAAAAACCCTAGACATGATTGATAATCCAATTAGTGAAGAAGATAATAATGAAGAACTTCTTGGTCCTAAAGTACCATATCT
TAGTGCAATTGGTGCTATGTATCTTGCTAATAATACAAGACCAGATATTGCATTTTCAGTAAATTTATTAGCTAGATACAATTCTTCTCCAACAAAAAGACATTGGAATG
GAATTAAGCACATACTCTGTTATCTCCGAGGAACAATCGATGTGGGTTATCTGTTCACATGTGGAGGAACTGCTATATCCTGGCGATTAGTGAAACAAACCATAACAGCA
ACTTCCTCAAATCATGCTGAAATTCTTGCAATCCATGAAGCTAGTAGAAAATGTGTATGGCTAAGATCAATGACTCAACACATTGGTGAAACATTTGATTTGTTTTCTAG
TAAAATTTCTCCAACGACATTATACGAAGACAACACAGCAAAACATTGGAATGCGGCGACTCAAAGATCTCAACATCTCACCGGGTCTCACACACGCCACCATTCCCAAC
CACGTGGAGCCGACGTCGTAGTCGTCAGTCATGTAGATCCGGCATTCGACGTTCGCACAAGCCCACTTGTCGAGTGTTCAGAGTTTTGTACCATTCCATTCGTCATCCGT
CGTTCCTACGCCCTCACGATGCCGTTGGGGGACGCTGTTGTTGCCGAATTTTGTTTTAGAAATGAAGGGCGTCAAGATGCCACTGGCAACATCTTGCTTCATAGAGGCTT
GAGATGGTTGCTATCTAATACATATATTCCATGGGGTGGAGATTCTCACTCACTGGGAGACGACTCTGAAGATGCTCAGAATCCTTCTCTCAGAAGAGCTCGAAATGGAG
GCTGGTTATGGCTCGACGCATGGGTTTCCACTTGTTCTCGCAGCAGCTTCGTCTCTTCGTACTTGGTTGTTTGCGCTTCTTTTCTTCTTAACCTTGCGTTAGGAACAAAA
AAGCTCATTGTTTTTCAGGATGAAGGTAAACGCAATAGGACAACGCATAGAGGAAGAACTGGAGAAGGATTTATTCGTTATGAATATGAGTGTTGGACGCATGCAGTCGC
CGCGTTGGCGGGGGGAAGGAACGTTCTTAGCTTACCATGGGTTGGCGTTAAGGTAGGTGGCGTTATGGGTGCCGTTGAAGTAAGAAATGAGAACCCAGTGCGCCAGCCTG
CCTTGGAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCGAAGCTTCCTACATTTGGATGGGGACATGCTATTTTGCATGTTGCATCACTTGCACATATTAAGTCAGTAGCTTATCATAAATACTCATCATTACAATTAGT
TTATGGCCAAGAGCCAAATATTTCTCATCCAAGAATTTTTGGATGTGCAGTATATGTTCCAATTGCTCTACTACAACGTACTAAGATGGGTCCTCAAAGGAGGTTAGGAA
TATACGTTGGATATGATTCCCCATCAATTATTAAATATCTTGAACCCCTGACGGGTGATGTATTTACTGCACGATTTGCTGATTGTCATTTTAATGAGATTAATTTTCCA
ACATTAGGGGGAGGAATTAAGAAGTTGGAAAAGGAAATTGCATGGAATGCATCATTATTGTCTCATTTAGATCCCCGTACAAATCAATTTGAACTTGAAGTTCAGAAAAT
AATTCATTTGCAAAATATAGCAAATCAATTACCAGATGCGTTTATAGATGCAAAGAAAGTAACTAAATCACATATACCCATTACAAATGTTCCGTCGAAAATTGATATCC
CAACACAGCAAGTTGTCACTAATGAATCTGGAACACACCAGAAGCGTGGTAGACCAGTGGGTTCCAAAGATAAAAATCCTCGGAAAAGAAAAATAAATGATAGTAAAAAA
GACCTTGTTGAGGATGTAAATATCCACGAAAAAACCCTAGACATGATTGATAATCCAATTAGTGAAGAAGATAATAATGAAGAACTTCTTGGTCCTAAAGTACCATATCT
TAGTGCAATTGGTGCTATGTATCTTGCTAATAATACAAGACCAGATATTGCATTTTCAGTAAATTTATTAGCTAGATACAATTCTTCTCCAACAAAAAGACATTGGAATG
GAATTAAGCACATACTCTGTTATCTCCGAGGAACAATCGATGTGGGTTATCTGTTCACATGTGGAGGAACTGCTATATCCTGGCGATTAGTGAAACAAACCATAACAGCA
ACTTCCTCAAATCATGCTGAAATTCTTGCAATCCATGAAGCTAGTAGAAAATGTGTATGGCTAAGATCAATGACTCAACACATTGGTGAAACATTTGATTTGTTTTCTAG
TAAAATTTCTCCAACGACATTATACGAAGACAACACAGCAAAACATTGGAATGCGGCGACTCAAAGATCTCAACATCTCACCGGGTCTCACACACGCCACCATTCCCAAC
CACGTGGAGCCGACGTCGTAGTCGTCAGTCATGTAGATCCGGCATTCGACGTTCGCACAAGCCCACTTGTCGAGTGTTCAGAGTTTTGTACCATTCCATTCGTCATCCGT
CGTTCCTACGCCCTCACGATGCCGTTGGGGGACGCTGTTGTTGCCGAATTTTGTTTTAGAAATGAAGGGCGTCAAGATGCCACTGGCAACATCTTGCTTCATAGAGGCTT
GAGATGGTTGCTATCTAATACATATATTCCATGGGGTGGAGATTCTCACTCACTGGGAGACGACTCTGAAGATGCTCAGAATCCTTCTCTCAGAAGAGCTCGAAATGGAG
GCTGGTTATGGCTCGACGCATGGGTTTCCACTTGTTCTCGCAGCAGCTTCGTCTCTTCGTACTTGGTTGTTTGCGCTTCTTTTCTTCTTAACCTTGCGTTAGGAACAAAA
AAGCTCATTGTTTTTCAGGATGAAGGTAAACGCAATAGGACAACGCATAGAGGAAGAACTGGAGAAGGATTTATTCGTTATGAATATGAGTGTTGGACGCATGCAGTCGC
CGCGTTGGCGGGGGGAAGGAACGTTCTTAGCTTACCATGGGTTGGCGTTAAGGTAGGTGGCGTTATGGGTGCCGTTGAAGTAAGAAATGAGAACCCAGTGCGCCAGCCTG
CCTTGGAGGGTTAA
Protein sequenceShow/hide protein sequence
MRAKLPTFGWGHAILHVASLAHIKSVAYHKYSSLQLVYGQEPNISHPRIFGCAVYVPIALLQRTKMGPQRRLGIYVGYDSPSIIKYLEPLTGDVFTARFADCHFNEINFP
TLGGGIKKLEKEIAWNASLLSHLDPRTNQFELEVQKIIHLQNIANQLPDAFIDAKKVTKSHIPITNVPSKIDIPTQQVVTNESGTHQKRGRPVGSKDKNPRKRKINDSKK
DLVEDVNIHEKTLDMIDNPISEEDNNEELLGPKVPYLSAIGAMYLANNTRPDIAFSVNLLARYNSSPTKRHWNGIKHILCYLRGTIDVGYLFTCGGTAISWRLVKQTITA
TSSNHAEILAIHEASRKCVWLRSMTQHIGETFDLFSSKISPTTLYEDNTAKHWNAATQRSQHLTGSHTRHHSQPRGADVVVVSHVDPAFDVRTSPLVECSEFCTIPFVIR
RSYALTMPLGDAVVAEFCFRNEGRQDATGNILLHRGLRWLLSNTYIPWGGDSHSLGDDSEDAQNPSLRRARNGGWLWLDAWVSTCSRSSFVSSYLVVCASFLLNLALGTK
KLIVFQDEGKRNRTTHRGRTGEGFIRYEYECWTHAVAALAGGRNVLSLPWVGVKVGGVMGAVEVRNENPVRQPALEG