; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G010460 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G010460
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationCmo_Chr15:6671188..6673814
RNA-Seq ExpressionCmoCh15G010460
SyntenyCmoCh15G010460
Gene Ontology termsGO:0006810 - transport (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAX96193.1 retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group]1.1e-6830.74Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN
        G+L G    P   +  +     +   NPA+  W   DQQVL  LLSS++ E+L  V    T+ E W  +++M+S+ TRA     R  L  +KK + S   
Subjt:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN

Query:  YFRKIKGLATELAAAGS-ALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLM---------------------------------------
        YF K+K L  E+A AG   + ++++I Y++ GLG  Y   V+++  + E +++ D+++ ++                                       
Subjt:  YFRKIKGLATELAAAGS-ALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLM---------------------------------------

Query:  TRGGQQKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLD
        +RGG    HGR + G GR +G  P        G   R  CQ+C K GHTA  CW+R DE Y  +   A     AA +SY +D NWY DTGATDH+T +LD
Subjt:  TRGGQQKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLD

Query:  RLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSL
        +L ++E+Y+GGEQ+   +GAG+                +Y+SRDV+FDEN+FPF     N+S  ++S     ++ T  L     ++ N+   M   TN L
Subjt:  RLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSL

Query:  DAEI--LVSTSASELPQQSSALLP--CESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVP-------------------------------
         A++   +S  A+E P +  A+ P   E     PP+  +  P P   +   P  +S+   P      VP                               
Subjt:  DAEI--LVSTSASELPQQSSALLP--CESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVP-------------------------------

Query:  ----------------------SNVDPASTTHPYG--TRLKHNIKKLKVCTDETVTYLVARSSAS-EPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLV
                              ++V    +  P G  TRL+  ++K KV TD T+ Y  +  +AS EPT+ + A++   W+ AM+ E+ AL KNKTWHLV
Subjt:  ----------------------SNVDPASTTHPYG--TRLKHNIKKLKVCTDETVTYLVARSSAS-EPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLV

Query:  PPRAGLNVIDCKWVFKLK
        PP+ G N+I CKWV+K+K
Subjt:  PPRAGLNVIDCKWVFKLK

KAG8084596.1 hypothetical protein GUJ93_ZPchr0010g7974 [Zizania palustris]1.1e-8758.97Show/hide
Query:  MGYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAA
        +G+LDG++ A +K + +ST AGA  ++NPAY +WY  DQQ+LSGLLSSM+EE+L DV  ATT+KEAWD LQR F+SSTRA   QIRVELAT KKRD SA 
Subjt:  MGYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAA

Query:  NYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMTRGGQQKNH-----------------GR---RDRGRGRS
        +YF K++GLA +LA AG+ L+DD+++AYL AGL  +YDPFVTSMTT     T+DDVFAHL+    +Q  H                 GR   R RGRG  
Subjt:  NYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMTRGGQQKNH-----------------GR---RDRGRGRS

Query:  QGY-APS----CPV--GDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGE
        +G+ APS     P   G  RG + R +CQIC+K GHTA+RCW+RMDESYQ+E   AP   +A+TSSYKID NWY DTGATDHITSDLDRLA+RERY+GG+
Subjt:  QGY-APS----CPV--GDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGE

Query:  QVQVGNGAGYSS
        QVQVGNGAG S+
Subjt:  QVQVGNGAGYSS

XP_012700051.1 uncharacterized protein LOC105914032 [Setaria italica]2.6e-7333.79Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN
        G+LD T   P K +        E   NP ++ W  +DQQVLS LLS++S++IL     A T+ EAW ++  +F+S TRA    +R+ LAT+KK  QS   
Subjt:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN

Query:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMTRGGQQKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSC
        YF K+KG   E+  AG AL DD+++ Y+LAGLGP+++  VTS+TT+ E++++D++++ L+T   +                              + L+ 
Subjt:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMTRGGQQKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSC

Query:  QICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVY
         +C+K GH+A++CW+R DE Y  E   A     AA++SY +D NWY+D+G+TDHITS+L +LA RE+Y+GG+Q+   NG                SG VY
Subjt:  QICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVY

Query:  MSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHM-----LMSGPTNSLDAEILV-----------STSASELPQQSSALLPC-
        +SRDV+FDE+VFPF+   PN+   ++     P +  L  S S     +D M     L +    S+ +++L               A++ P  ++   PC 
Subjt:  MSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHM-----LMSGPTNSLDAEILV-----------STSASELPQQSSALLPC-

Query:  ------ESALVVPPMIE--------------ASAPP--------PADDIVQCPVESSA--AGQPTTVA------------------------TVVPSNVD
                 L   P  E              A  PP        P  D        SA  A + T  A                        T V ++  
Subjt:  ------ESALVVPPMIE--------------ASAPP--------PADDIVQCPVESSA--AGQPTTVA------------------------TVVPSNVD

Query:  PASTTHPYGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        PA+ T    TRL+  I+K K  TD TV Y +  +S  EP+S   A+    W QAM  E++AL KN+TWHLVPP+ G NVI CKWV+K+K
Subjt:  PASTTHPYGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

XP_012701436.1 uncharacterized protein LOC105914402 [Setaria italica]3.6e-7534.56Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAEL-VSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAA
        GYL+G   APA+ +    A G  + V NP Y  W+ +DQ+VL  +LSS+ +E+   VVAA T+ EAW  ++ MFS+ TRA T  +R+ LAT++K +Q+ A
Subjt:  GYLDGTNAAPAKLVPSSTAAGAEL-VSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAA

Query:  NYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------RGGQ---------------
         YF K+KG A E+AAAG  + D+D+ A++  GL  +++P VTS+T ++E L++ ++++ L++                   RGG+               
Subjt:  NYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------RGGQ---------------

Query:  ---QKNHGRRDRGRGRSQGYAPSCPVGDRRGPSA----------RLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGAT
            +  GR D GRG S+G +P+   G +RG  A          R  CQ+C K GHTA++CWHR DESY  E          A +SY +D NWY+DTGAT
Subjt:  ---QKNHGRRDRGRGRSQGYAPSCPVGDRRGPSA----------RLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGAT

Query:  DHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTH-NAPNLCTLHLSNSSTNLENDHM
        DH+TS+L++L+ RERYHG +Q+   +GAG + L      L ++   + +       + VF       NS+    +TH N  + C   +S+      +++ 
Subjt:  DHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTH-NAPNLCTLHLSNSSTNLENDHM

Query:  LMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVP-PMIEASAPP------------------PADDIVQCPVESSAAGQPTTVATV-VPSN----
            P  S     L S S     Q++SA      A  +P P     APP                  PA D    P   S A       T  VP      
Subjt:  LMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVP-PMIEASAPP------------------PADDIVQCPVESSAAGQPTTVATV-VPSN----

Query:  --VDPASTTHP------YGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
          VD +S   P      + TRL   I+K K+ TD TV Y    +++ EP +   A+ +  W+ AM+ E++AL +N+TWHLVPPR G+N+IDCKWV+K+K
Subjt:  --VDPASTTHP------YGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

XP_022931693.1 uncharacterized protein LOC111437852 [Cucurbita moschata]1.8e-7174.16Show/hide
Query:  MDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR
        MDES QDEP S P   LAATSS KIDPNWYSDT ATDHITSDLDRLAVRERYHGG+QVQVGNGA       GYKCLDT+SG V +SRDVIFDENVFPFKR
Subjt:  MDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR

Query:  APPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQ
        AP NSSP +  THNA +LCTLHL +SSTNL+NDH  +S PTNSLDA+ LV TS S+LPQQSSA L C+SAL V PMI AS P PADDI QCPV S AAGQ
Subjt:  APPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQ

Query:  PTTVATVVP
        PT VA   P
Subjt:  PTTVATVVP

TrEMBL top hitse value%identityAlignment
A0A2N9FGM0 Reverse transcriptase Ty1/copia-type domain-containing protein3.3e-7435.64Show/hide
Query:  PAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANYFRKIKGLA
        P  ++ +S+   +  + NP + QW  QDQ VLS L+SS+SE+++  VV  TTS++ W TL+RMF++ ++A   QI  +L+T +K   S +++F+   GLA
Subjt:  PAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANYFRKIKGLA

Query:  TELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQQKNHGRRDRGRGR
          LAA    L +  ++++LLAGLGP+YD FVTS+  ++E +TLD ++ HL+T                               RGG+ ++     RG+  
Subjt:  TELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQQKNHGRRDRGRGR

Query:  SQGYAPSCPVG-DRRGP-SARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ERYHGGEQVQ
        S  +  +   G  R  P +AR  CQ+C++  H A+ C+HR D SY  E  SA      +T     DPNWY+DTGAT+H+TSDL  L V  E Y G +Q++
Subjt:  SQGYAPSCPVG-DRRGP-SARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ERYHGGEQVQ

Query:  VGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQ
        VGNG G S  H         +G +Y+SRDVIF+E  FPF   PP  + + Q+   +P L                 L+  PT               LP 
Subjt:  VGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQ

Query:  QSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVARSSAS--EP
        Q+    P +  L   P  + S   PA       + S A  QPT    +   +  P + +HP  TR K NI K K   D TV Y     L+A ++ S  EP
Subjt:  QSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVARSSAS--EP

Query:  TSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        T + +A++ P WR AMN EF AL KN+TW LVP     N++ CKWVF++K
Subjt:  TSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

A0A2N9FW27 Uncharacterized protein3.8e-7535.48Show/hide
Query:  YLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANY
        ++DG+   P  ++ +S+   +  + NP + QW  QDQ VLS L+SS+SE+++  VV  TTS++ W TL+RMF++ ++A   QI  +L+T +K   S +++
Subjt:  YLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANY

Query:  FRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQQKNHG
        F+   GLA  LAA    L +  ++++LLAGLGP+YD FVTS+  ++E +TLD ++ HL+T                               RGG+ ++  
Subjt:  FRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQQKNHG

Query:  RRDRGRGRSQGYAPSCPVG-DRRGP-SARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ER
           RG+  S  +  +   G  R  P +AR  CQ+C++  H A+ C+HR D SY  E  SA      +T     DPNWY+DTGAT+H+TSDL  L V  E 
Subjt:  RRDRGRGRSQGYAPSCPVG-DRRGP-SARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ER

Query:  YHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVS
        Y G +Q++VGNG G S  H         +G +Y+SRDVIF+E  FPF   PP  +   Q+   +P L                 L+  PT          
Subjt:  YHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVS

Query:  TSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVAR
             LP Q+    P +  L   P  + S   PA       + S A  QPT    +   +  P + +HP  TR K NI K K   D TV Y     L+A 
Subjt:  TSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVAR

Query:  SSAS--EPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        ++ S  EPT + +A++ P WR AMN EF AL KN+TW LVP     N++ CKWVF++K
Subjt:  SSAS--EPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

A0A2N9GWM4 Uncharacterized protein4.3e-7434.57Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN
        G++DGT  AP   +  S +     + NP +  W+ QDQ +LS L+SS+SE IL  VV  TTS++ W  L+RMF+S +RA + Q+  +L+T KK D S A+
Subjt:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN

Query:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQ----
        ++ K   LA  LAA    L+D D++++ LAGLG DYD  VT++  +   +TLD+++   ++                               RGG+    
Subjt:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMT-------------------------------RGGQ----

Query:  -------QKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPT--TLAATSSYKIDPNWYSDTGATDHIT
                 N  R+ RGRGR +     CP       S+R +CQ+C K GHTA+ C+HR D SY  E    PP    L AT +Y  DPNWYSD+GAT HIT
Subjt:  -------QKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSCQICDKVGHTAIRCWHRMDESYQDEPSSAPPT--TLAATSSYKIDPNWYSDTGATDHIT

Query:  SDLDRLAVR-ERYHGGEQVQVGNG-------------------------------AGYSSLHE-----GYKCLDTNSGCVYMSRDVIFDENVFPFKRAPP
        SDL  L VR + YHG +Q++  +                                   +S+H+     GYKC    +G  Y+SRDVIF E  FPF++ P 
Subjt:  SDLDRLAVR-ERYHGGEQVQVGNG-------------------------------AGYSSLHE-----GYKCLDTNSGCVYMSRDVIFDENVFPFKRAPP

Query:  NSSPIMQSTHN--APNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSS---ALLPCESALVVPPMIEASAPPPADDIVQCPVESSAA
          +P+ QST +   P +  L L +             GP N+  + +   T +   P+ SS   A  P  ++L V P    S+ P  D  V  P   +A 
Subjt:  NSSPIMQSTHN--APNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSS---ALLPCESALVVPPMIEASAPPPADDIVQCPVESSAA

Query:  GQPTTVATVVPSNVDPASTT---------HPYGTRLKHNIKKLKVCTDETVTYLVAR---------SSASEPTSHITAMEHPLWRQAMNDEFQALQKNKT
         + +T  TV  +N + +STT         HP  TR ++ I K K  TD T+ Y + +         +  +EPT H  A + P WR+AMN EF AL KN T
Subjt:  GQPTTVATVVPSNVDPASTT---------HPYGTRLKHNIKKLKVCTDETVTYLVAR---------SSASEPTSHITAMEHPLWRQAMNDEFQALQKNKT

Query:  WHLVPPRAGLNVIDCKWVFKLK
        W LVP  +  N+I CKWVF++K
Subjt:  WHLVPPRAGLNVIDCKWVFKLK

A0A2N9HRE1 Reverse transcriptase Ty1/copia-type domain-containing protein1.8e-7236.64Show/hide
Query:  DQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDY
        DQ +LS L+SS+SE+++  VV  TTS++ W TL+RMF++ ++A   QI  +L+T +K   S A++F    GLA  LAA    L +  ++++LLAGLGP+Y
Subjt:  DQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDY

Query:  DPFVTSMTTKSEALTLDDVFAHLMTRGGQQK-------------NHGRRD----RGRGRSQGYAPSCPVGDRRGPS----------------ARLSCQIC
        D FVTS+  ++E +TLD ++ HL+    + K             N+  R      GRG     +PS   G+   PS                AR  CQ+C
Subjt:  DPFVTSMTTKSEALTLDDVFAHLMTRGGQQK-------------NHGRRD----RGRGRSQGYAPSCPVGDRRGPS----------------ARLSCQIC

Query:  DKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMS
        +++GH A+ C+HR D +Y  E S+A      +T     DPNWY+DTGAT+H+TSDL+ L +  E Y G +Q++VGNG G S  H          G +Y+S
Subjt:  DKVGHTAIRCWHRMDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVR-ERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMS

Query:  RDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESAL-VVPPMIEASAPPPA
         DVIF+EN+FPF+  PP     +Q     P L                 L+  PT S  A+   + S   +P  SS + P  S +    P    S+  PA
Subjt:  RDVIFDENVFPFKRAPPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESAL-VVPPMIEASAPPPA

Query:  DDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVARS--SASEPTSHITAMEHPLWRQAMNDEFQALQKN
               +  +A   PT+  ++ P +  P +++HP  TR K NI K K   D TV Y     L+A +  S SEPT + +A++ P WR+AMN EF AL KN
Subjt:  DDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGTRLKHNIKKLKVCTDETVTY-----LVARS--SASEPTSHITAMEHPLWRQAMNDEFQALQKN

Query:  KTWHLVPPRAGLNVIDCKWVFKLK
        +TW LVP     N++ CKWVF++K
Subjt:  KTWHLVPPRAGLNVIDCKWVFKLK

A0A6J1EZF8 uncharacterized protein LOC1114378528.9e-7274.16Show/hide
Query:  MDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR
        MDES QDEP S P   LAATSS KIDPNWYSDT ATDHITSDLDRLAVRERYHGG+QVQVGNGA       GYKCLDT+SG V +SRDVIFDENVFPFKR
Subjt:  MDESYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR

Query:  APPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQ
        AP NSSP +  THNA +LCTLHL +SSTNL+NDH  +S PTNSLDA+ LV TS S+LPQQSSA L C+SAL V PMI AS P PADDI QCPV S AAGQ
Subjt:  APPNSSPIMQSTHNAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQ

Query:  PTTVATVVP
        PT VA   P
Subjt:  PTTVATVVP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-0443.08Show/hide
Query:  TYLVARSSASEPTSHITAMEHPLWRQ---AMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        T  V  S   EP S    + HP   Q   AM +E ++LQKN T+ LV    G   + CKWVFKLK
Subjt:  TYLVARSSASEPTSHITAMEHPLWRQ---AMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

P92520 Uncharacterized mitochondrial protein AtMg008202.9e-1146.25Show/hide
Query:  TRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        TR K  I KL      T+T  + +    EP S I A++ P W QAM +E  AL +NKTW LVPP    N++ CKWVFK K
Subjt:  TRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-2426.25Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN
        G+LDG+   P    P++    A    NP Y +W  QD+ + S +L ++S  +   V  ATT+ + W+TL++++++ +  H TQ+R +L    K  ++  +
Subjt:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN

Query:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLM-----------------------------TRGGQQKNHGR
        Y + +     +LA  G  +  D+ +  +L  L  +Y P +  +  K    TL ++   L+                             T      N   
Subjt:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLM-----------------------------TRGGQQKNHGR

Query:  RDRGRGRSQGYAPSCPVGDRRGPSARLS------CQICDKVGHTAIRC---WHRMDESYQDEPSS-----APPTTLAATSSYKIDPNWYSDTGATDHITS
        R   R  +    P         P+   S      CQIC   GH+A RC    H +      +P S      P   LA  S Y  + NW  D+GAT HITS
Subjt:  RDRGRGRSQGYAPSCPVGDRRGPSARLS------CQICDKVGHTAIRC---WHRMDESYQDEPSS-----APPTTLAATSSYKIDPNWYSDTGATDHITS

Query:  DLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNS
        D + L++ + Y GG+ V V +G+     H G   L T S
Subjt:  DLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-2427.27Show/hide
Query:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN
        G+LDG+   P    P++    A    NP Y +W  QD+ + S +L ++S  +   V  ATT+ + W+TL++++++ +  H TQ+R       + DQ    
Subjt:  GYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAAN

Query:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTK-----------------SEALTLDD-----VFAHLMTRGGQQKNHGRRDRG---
                   LA  G  +  D+ +  +L  L  DY P +  +  K                 S+ L L+      + A+++T      N  + +RG   
Subjt:  YFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTK-----------------SEALTLDD-----VFAHLMTRGGQQKNHGRRDRG---

Query:  -----RGRSQGYAPSC--PVGDRRGPSARLS-CQICDKVGHTAIRC--WHRMDESYQDEPSSAPPT------TLAATSSYKIDPNWYSDTGATDHITSDL
               RS  + PS      D R P   L  CQIC   GH+A RC   H+   +   + S++P T       LA  S Y  + NW  D+GAT HITSD 
Subjt:  -----RGRSQGYAPSC--PVGDRRGPSARLS-CQICDKVGHTAIRC--WHRMDESYQDEPSSAPPT------TLAATSSYKIDPNWYSDTGATDHITSDL

Query:  DRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENV
        + L+  + Y GG+ V + +G+     H G   L T+S  + +++ V++  N+
Subjt:  DRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-1125.17Show/hide
Query:  GYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR---------------------------------APPNSSPIMQSTHNAPN----LCTLHLSNSS
        GYS     Y CL   +G +Y SR V FDE  FPF                                   APP   P + ++   P+    LCT  +S+S+
Subjt:  GYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKR---------------------------------APPNSSPIMQSTHNAPN----LCTLHLSNSS

Query:  TNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPC----------------ESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPS-
            +     S    +        T+     Q S++  P                  S L   P+     P P+  I +    SS++     +  V+P+ 
Subjt:  TNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPC----------------ESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPS-

Query:  -----NVDPASTTHPYGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLV-PPRAGLNVIDCKWVFKLK
             N      TH   TR K  I+K     ++  +Y  + ++ SEP + I AM+   WRQAM  E  A   N TW LV PP   + ++ C+W+F  K
Subjt:  -----NVDPASTTHPYGTRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLV-PPRAGLNVIDCKWVFKLK

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-0625Show/hide
Query:  MGYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMS-EEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSA
        MG++DGT      L+P          +N     W  +D  V   L  +++ ++     V ++TS++ W  ++  F ++  A   ++  EL T    D   
Subjt:  MGYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMS-EEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSA

Query:  ANYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDD
        A+Y+RK+K LA  L      + D +++ Y+L GL P +D  +  +  +    + DD
Subjt:  ANYFRKIKGLATELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDD

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.9e-0635.48Show/hide
Query:  TYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        ++LV  + A EP+++  A E  +W  AM+DE  A++   TW +         I CKWV+K+K
Subjt:  TYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-1246.25Show/hide
Query:  TRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK
        TR K  I KL      T+T  + +    EP S I A++ P W QAM +E  AL +NKTW LVPP    N++ CKWVFK K
Subjt:  TRLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTACCTCGATGGCACCAATGCCGCACCTGCCAAGCTGGTCCCTTCCTCAACTGCTGCTGGTGCTGAGCTGGTCTCTAATCCGGCCTATAATCAGTGGTATGGTCA
GGATCAACAGGTCCTTAGCGGTCTTCTCTCCTCCATGTCTGAGGAGATTCTTCATGATGTGGTTGCCGCTACTACGTCCAAGGAGGCGTGGGATACCCTGCAGCGGATGT
TTTCATCGTCAACTCGTGCTCACACTACTCAGATTCGTGTTGAGCTCGCCACGTCCAAGAAACGAGATCAATCTGCTGCAAATTATTTTCGCAAGATCAAAGGACTGGCC
ACCGAGCTGGCTGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTTCTCGCTGGTCTTGGCCCTGACTATGATCCCTTCGTCACCTCAATGACTACCAA
GAGTGAAGCCCTCACGCTTGATGATGTGTTTGCACATCTAATGACTCGTGGTGGTCAGCAGAAGAATCATGGGCGTAGGGATCGTGGTCGTGGTCGTTCTCAAGGTTATG
CACCCTCTTGTCCTGTTGGTGATCGTCGTGGCCCTTCTGCTCGTCTTTCCTGCCAGATCTGCGACAAAGTGGGGCATACTGCTATCCGCTGCTGGCATAGGATGGATGAG
TCCTATCAAGATGAACCTTCTTCTGCTCCTCCTACGACACTGGCGGCTACTTCCTCTTACAAGATTGACCCAAATTGGTACAGCGACACAGGCGCTACGGACCATATCAC
CAGTGACCTGGATCGTCTCGCTGTGCGTGAACGTTATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAGGCTACAGTTCTTTACATGAAGGGTATAAGTGTCTTG
ACACCAATTCTGGTTGTGTCTATATGTCTAGAGATGTCATATTTGATGAAAATGTTTTTCCATTCAAGAGAGCCCCACCTAATTCTTCCCCAATCATGCAGTCGACGCAT
AATGCCCCTAATTTGTGCACCTTGCATTTGAGTAATAGCAGTACTAATTTGGAGAATGATCACATGCTTATGTCTGGGCCTACTAACTCTTTGGATGCAGAAATTTTGGT
GTCTACATCAGCTTCGGAATTGCCGCAGCAATCCTCCGCGTTGCTGCCATGCGAATCGGCGTTGGTTGTTCCGCCAATGATTGAAGCCTCGGCTCCTCCGCCAGCAGATG
ATATTGTGCAATGCCCGGTCGAATCCTCGGCCGCTGGTCAACCAACTACTGTAGCAACGGTTGTCCCCTCAAATGTGGATCCTGCATCTACTACTCATCCGTATGGTACT
CGATTGAAGCACAATATCAAGAAACTCAAGGTGTGTACAGATGAAACAGTAACATATCTTGTAGCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTACTGCTATGGA
GCATCCCCTCTGGCGTCAGGCAATGAATGATGAATTTCAGGCACTTCAAAAAAATAAGACATGGCACTTAGTTCCTCCTCGTGCTGGTCTTAACGTTATTGATTGCAAAT
GGGTTTTCAAACTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTACCTCGATGGCACCAATGCCGCACCTGCCAAGCTGGTCCCTTCCTCAACTGCTGCTGGTGCTGAGCTGGTCTCTAATCCGGCCTATAATCAGTGGTATGGTCA
GGATCAACAGGTCCTTAGCGGTCTTCTCTCCTCCATGTCTGAGGAGATTCTTCATGATGTGGTTGCCGCTACTACGTCCAAGGAGGCGTGGGATACCCTGCAGCGGATGT
TTTCATCGTCAACTCGTGCTCACACTACTCAGATTCGTGTTGAGCTCGCCACGTCCAAGAAACGAGATCAATCTGCTGCAAATTATTTTCGCAAGATCAAAGGACTGGCC
ACCGAGCTGGCTGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTTCTCGCTGGTCTTGGCCCTGACTATGATCCCTTCGTCACCTCAATGACTACCAA
GAGTGAAGCCCTCACGCTTGATGATGTGTTTGCACATCTAATGACTCGTGGTGGTCAGCAGAAGAATCATGGGCGTAGGGATCGTGGTCGTGGTCGTTCTCAAGGTTATG
CACCCTCTTGTCCTGTTGGTGATCGTCGTGGCCCTTCTGCTCGTCTTTCCTGCCAGATCTGCGACAAAGTGGGGCATACTGCTATCCGCTGCTGGCATAGGATGGATGAG
TCCTATCAAGATGAACCTTCTTCTGCTCCTCCTACGACACTGGCGGCTACTTCCTCTTACAAGATTGACCCAAATTGGTACAGCGACACAGGCGCTACGGACCATATCAC
CAGTGACCTGGATCGTCTCGCTGTGCGTGAACGTTATCATGGAGGTGAACAAGTTCAAGTCGGCAATGGAGCAGGCTACAGTTCTTTACATGAAGGGTATAAGTGTCTTG
ACACCAATTCTGGTTGTGTCTATATGTCTAGAGATGTCATATTTGATGAAAATGTTTTTCCATTCAAGAGAGCCCCACCTAATTCTTCCCCAATCATGCAGTCGACGCAT
AATGCCCCTAATTTGTGCACCTTGCATTTGAGTAATAGCAGTACTAATTTGGAGAATGATCACATGCTTATGTCTGGGCCTACTAACTCTTTGGATGCAGAAATTTTGGT
GTCTACATCAGCTTCGGAATTGCCGCAGCAATCCTCCGCGTTGCTGCCATGCGAATCGGCGTTGGTTGTTCCGCCAATGATTGAAGCCTCGGCTCCTCCGCCAGCAGATG
ATATTGTGCAATGCCCGGTCGAATCCTCGGCCGCTGGTCAACCAACTACTGTAGCAACGGTTGTCCCCTCAAATGTGGATCCTGCATCTACTACTCATCCGTATGGTACT
CGATTGAAGCACAATATCAAGAAACTCAAGGTGTGTACAGATGAAACAGTAACATATCTTGTAGCTCGGTCTTCTGCCTCTGAACCTACTTCACATATTACTGCTATGGA
GCATCCCCTCTGGCGTCAGGCAATGAATGATGAATTTCAGGCACTTCAAAAAAATAAGACATGGCACTTAGTTCCTCCTCGTGCTGGTCTTAACGTTATTGATTGCAAAT
GGGTTTTCAAACTCAAGTAA
Protein sequenceShow/hide protein sequence
MGYLDGTNAAPAKLVPSSTAAGAELVSNPAYNQWYGQDQQVLSGLLSSMSEEILHDVVAATTSKEAWDTLQRMFSSSTRAHTTQIRVELATSKKRDQSAANYFRKIKGLA
TELAAAGSALQDDDVIAYLLAGLGPDYDPFVTSMTTKSEALTLDDVFAHLMTRGGQQKNHGRRDRGRGRSQGYAPSCPVGDRRGPSARLSCQICDKVGHTAIRCWHRMDE
SYQDEPSSAPPTTLAATSSYKIDPNWYSDTGATDHITSDLDRLAVRERYHGGEQVQVGNGAGYSSLHEGYKCLDTNSGCVYMSRDVIFDENVFPFKRAPPNSSPIMQSTH
NAPNLCTLHLSNSSTNLENDHMLMSGPTNSLDAEILVSTSASELPQQSSALLPCESALVVPPMIEASAPPPADDIVQCPVESSAAGQPTTVATVVPSNVDPASTTHPYGT
RLKHNIKKLKVCTDETVTYLVARSSASEPTSHITAMEHPLWRQAMNDEFQALQKNKTWHLVPPRAGLNVIDCKWVFKLK