; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0050171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0050171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:16194863..16195732
RNA-Seq ExpressionCmc02g0050171
SyntenyCmc02g0050171
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-15091.35Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKN LLN+L+D SLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKARGGFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEKFKEYK EVE LLSK+IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQL SSFWGYAVETAVHILNNVPSKSVS TPFELWRGRKPSLSHFRIW CPAH+LVTN KK+EP SRLCQFVG  KETRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-14789.62Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+ LLN+L+D SLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKARG FEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEKFKEYK EVE LLSK+IKI RSDR GEYMDL FQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQL SSFWGYAVETAVHILNNVPSKSVS TPFELWRGRKPSLSHFRIW CPAH+LVTN KK+EP SRLCQFVG  KETRGG+FFD +EN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

KAA0043389.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-14998.11Show/hide
Query:  RLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE
        +LGHINLDRIGRLVKN LLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE
Subjt:  RLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE

Query:  KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSK
        KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSE+RNRTLLDIV SMMSYAQLASSFWGYAVETAVHILNNVPSK
Subjt:  KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSK

Query:  SVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        SVSVTPFELWRGRKPSLSHFRIWSCPAH+LVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
Subjt:  SVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-14688.93Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKN LLN+LEDDSLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKA GGFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYS YGYLYL+EHKSEALEKFKEYK EVE LLSK+IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+V SMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
         QL SSFWGYAVETAVHILNNVPSK+V  TPFELWRGRKPSLSHFRIW CP H+LVTN KK+EP SRLCQFVG  KETRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

KAA0065386.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-14387.89Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNN TYLWHLRLGHINLD+IGRLVKN LLN+LEDDSLPPCES LEGKMTKRPF GKGYRAKE LELIHSDL GPMNVKAR GFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEK KEY+ EVE LLS++IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQ  SSFWGYAVETAVHILNNVPSKSVS  PFELWRGRKPSLSHFRIW CP HMLVTN KK+EP SRLCQFVG  K+TRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.1e-14789.62Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+ LLN+L+D SLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKARG FEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEKFKEYK EVE LLSK+IKI RSDR GEYMDL FQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQL SSFWGYAVETAVHILNNVPSKSVS TPFELWRGRKPSLSHFRIW CPAH+LVTN KK+EP SRLCQFVG  KETRGG+FFD +EN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

A0A5A7TQA5 Gag/pol protein7.7e-15098.11Show/hide
Query:  RLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE
        +LGHINLDRIGRLVKN LLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE
Subjt:  RLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKSEALE

Query:  KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSK
        KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSE+RNRTLLDIV SMMSYAQLASSFWGYAVETAVHILNNVPSK
Subjt:  KFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSK

Query:  SVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        SVSVTPFELWRGRKPSLSHFRIWSCPAH+LVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
Subjt:  SVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

A0A5A7TZD0 Gag/pol protein1.6e-15091.35Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKN LLN+L+D SLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKARGGFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEKFKEYK EVE LLSK+IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQL SSFWGYAVETAVHILNNVPSKSVS TPFELWRGRKPSLSHFRIW CPAH+LVTN KK+EP SRLCQFVG  KETRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

A0A5A7VGC7 Gag/pol protein1.8e-14387.89Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRISPNN TYLWHLRLGHINLD+IGRLVKN LLN+LEDDSLPPCES LEGKMTKRPF GKGYRAKE LELIHSDL GPMNVKAR GFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYSRYGYLYLMEHKSEALEK KEY+ EVE LLS++IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+VRSMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
        AQ  SSFWGYAVETAVHILNNVPSKSVS  PFELWRGRKPSLSHFRIW CP HMLVTN KK+EP SRLCQFVG  K+TRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

A0A5D3BNE1 Gag/pol protein3.0e-14688.93Show/hide
Query:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY
        MFR ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKN LLN+LEDDSLPPCESCLEGKMTKRPFTGKGYRAKE LELIHSDLCGPMNVKA GGFEY
Subjt:  MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEY

Query:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY
        FIS IDDYS YGYLYL+EHKSEALEKFKEYK EVE LLSK+IKILRSDR GEYMDLRFQDYMIEHGIQ QLSA GTPQQNGVSERRNRTLLD+V SMMSY
Subjt:  FISSIDDYSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSY

Query:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN
         QL SSFWGYAVETAVHILNNVPSK+V  TPFELWRGRKPSLSHFRIW CP H+LVTN KK+EP SRLCQFVG  KETRGG+FFD QEN
Subjt:  AQLASSFWGYAVETAVHILNNVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-4034.81Show/hide
Query:  NNTYLWHLRLGHIN------LDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLE----LIHSDLCGPMNVKARGGFEYFISSIDD
        NN  LWH R GHI+      + R        LLN LE  S   CE CL GK  + PF  K  + K H++    ++HSD+CGP+         YF+  +D 
Subjt:  NNTYLWHLRLGHIN------LDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLE----LIHSDLCGPMNVKARGGFEYFISSIDD

Query:  YSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSF
        ++ Y   YL+++KS+    F+++ A+ E   + ++  L  D   EY+    + + ++ GI + L+   TPQ NGVSER  RT+ +  R+M+S A+L  SF
Subjt:  YSRYGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSF

Query:  WGYAVETAVHILNNVPSKSV---SVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLK-KMEPHSRLCQFVG
        WG AV TA +++N +PS+++   S TP+E+W  +KP L H R++    ++ + N + K +  S    FVG
Subjt:  WGYAVETAVHILNNVPSKSV---SVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLK-KMEPHSRLCQFVG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-4936.7Show/hide
Query:  LWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS
        LWH R+GH++   +  L K  L++  +  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++ IDD SR  ++Y+++ K 
Subjt:  LWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN
        +  + F+++ A VE    +++K LRSD  GEY    F++Y   HGI+ + +  GTPQ NGV+ER NRT+++ VRSM+  A+L  SFWG AV+TA +++N 
Subjt:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN

Query:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCP--AHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFD
         PS  ++   P  +W  ++ S SH +++ C   AH+      K++  S  C F+G   E  G   +D
Subjt:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCP--AHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFD

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.2e-1628.63Show/hide
Query:  NTQNKRQRISPNNNTY-LWHLRLGHINLDRIGRLVKNELLNELEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EHLELIHSDLCGPMNV
        N  NK +  S N   Y L H  LGH N   I + +K   +  L++  +         C  CL GK TK     KG R K     E  + +H+D+ GP++ 
Subjt:  NTQNKRQRISPNNNTY-LWHLRLGHINLDRIGRLVKNELLNELEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EHLELIHSDLCGPMNV

Query:  KARGGFEYFISSIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTL
          +    YFIS  D+ +R+ ++Y +  + E   L  F    A ++   + R+ +++ DR  EY +     +    GI    +     + +GV+ER NRTL
Subjt:  KARGGFEYFISSIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTL

Query:  LDIVRSMMSYAQLASSFWGYAVETAVHILNNVPS
        L+  R+++  + L +  W  AVE +  I N++ S
Subjt:  LDIVRSMMSYAQLASSFWGYAVETAVHILNNVPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-3029.41Show/hide
Query:  WHLRLGHINLDRIGRLVKNELLNELE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS
        WH RLGH     +  ++ N  L+ L        C  CL  K  K PF+     +   LE I+SD+     + +   + Y++  +D ++RY +LY ++ KS
Subjt:  WHLRLGHINLDRIGRLVKNELLNELE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN
        +  E F  +K  +E     RI    SD  GE++ L   +Y  +HGI    S   TP+ NG+SER++R +++   +++S+A +  ++W YA   AV+++N 
Subjt:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN

Query:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAH--MLVTNLKKMEPHSRLCQFVG
        +P+  + + +PF+   G  P+    R++ C  +  +   N  K++  SR C F+G
Subjt:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAH--MLVTNLKKMEPHSRLCQFVG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-3230.59Show/hide
Query:  WHLRLGHINLDRIGRLVKNELLNELE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS
        WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + + LE I+SD+     + +   + Y++  +D ++RY +LY ++ KS
Subjt:  WHLRLGHINLDRIGRLVKNELLNELE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSRYGYLYLMEHKS

Query:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN
        +  + F  +K+ VE     RI  L SD  GE++ LR  DY+ +HGI    S   TP+ NG+SER++R ++++  +++S+A +  ++W YA   AV+++N 
Subjt:  EALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNN

Query:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAH--MLVTNLKKMEPHSRLCQFVG
        +P+  + + +PF+   G+ P+    +++ C  +  +   N  K+E  S+ C F+G
Subjt:  VPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAH--MLVTNLKKMEPHSRLCQFVG

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.5e-0736Show/hide
Query:  NNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNV
        + T LWH RL H++   +  LVK   L+  +  SL  CE C+ GK  +  F+   +  K  L+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0737.68Show/hide
Query:  NRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAHM
        NRT+++ VRSM+    L  +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R + C A++
Subjt:  NRTLLDIVRSMMSYAQLASSFWGYAVETAVHILNNVPSKSVSV-TPFELWRGRKPSLSHFRIWSCPAHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGAATTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAG
GTTGGTAAAGAATGAACTTCTAAACGAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATA
GAGCCAAAGAGCATTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTCTATAGATGATTATTCAAGG
TATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAATTCTATTAAGTAAAAGGATTAAAATACTTCGATC
TGATCGAGTTGGAGAGTACATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACATGGTACACCTCAACAAAATGGTGTATCAG
AAAGGAGAAATAGAACCTTGTTAGACATCGTTCGTTCAATGATGAGTTACGCTCAATTGGCTAGCTCATTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAAC
AATGTTCCCTCGAAGAGTGTTTCTGTAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGAGTTGTCCAGCACACATGTTAGTGAC
AAATCTCAAGAAGATGGAACCTCATTCAAGGTTATGTCAATTTGTTGGTTGTCATAAAGAGACGAGAGGTGGTATATTCTTCGATCTACAAGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGAATTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAG
GTTGGTAAAGAATGAACTTCTAAACGAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATA
GAGCCAAAGAGCATTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTCTATAGATGATTATTCAAGG
TATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAATTCTATTAAGTAAAAGGATTAAAATACTTCGATC
TGATCGAGTTGGAGAGTACATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACATGGTACACCTCAACAAAATGGTGTATCAG
AAAGGAGAAATAGAACCTTGTTAGACATCGTTCGTTCAATGATGAGTTACGCTCAATTGGCTAGCTCATTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAAC
AATGTTCCCTCGAAGAGTGTTTCTGTAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGAGTTGTCCAGCACACATGTTAGTGAC
AAATCTCAAGAAGATGGAACCTCATTCAAGGTTATGTCAATTTGTTGGTTGTCATAAAGAGACGAGAGGTGGTATATTCTTCGATCTACAAGAAAATTGA
Protein sequenceShow/hide protein sequence
MFRIANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNELLNELEDDSLPPCESCLEGKMTKRPFTGKGYRAKEHLELIHSDLCGPMNVKARGGFEYFISSIDDYSR
YGYLYLMEHKSEALEKFKEYKAEVEILLSKRIKILRSDRVGEYMDLRFQDYMIEHGIQFQLSAHGTPQQNGVSERRNRTLLDIVRSMMSYAQLASSFWGYAVETAVHILN
NVPSKSVSVTPFELWRGRKPSLSHFRIWSCPAHMLVTNLKKMEPHSRLCQFVGCHKETRGGIFFDLQEN