; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0198071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0198071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-Pol polyprotein
Genome locationCMiso1.1chr07:21332001..21332780
RNA-Seq ExpressionCmc07g0198071
SyntenyCmc07g0198071
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0018108 - peptidyl-tyrosine phosphorylation (biological process)
GO:0045489 - pectin biosynthetic process (biological process)
GO:0080090 - regulation of primary metabolic process (biological process)
GO:0060255 - regulation of macromolecule metabolic process (biological process)
GO:0051171 - regulation of nitrogen compound metabolic process (biological process)
GO:0006281 - DNA repair (biological process)
GO:0006413 - translational initiation (biological process)
GO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0007018 - microtubule-based movement (biological process)
GO:0048544 - recognition of pollen (biological process)
GO:0016020 - membrane (cellular component)
GO:0016298 - lipase activity (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0047262 - polygalacturonate 4-alpha-galacturonosyltransferase activity (molecular function)
GO:0008017 - microtubule binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004714 - transmembrane receptor protein tyrosine kinase activity (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0004144 - diacylglycerol O-acyltransferase activity (molecular function)
GO:0003777 - microtubule motor activity (molecular function)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAD34493.1 Gag-Pol [Ipomoea batatas]2.9e-10271.81Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MK+ QGALV+MK  K+ ANLYML+GETLQE EASVA+ S    L  +WH+KLGHMS++G+K+LVE+ L+P LTKVSLP  EHC+TSKQHRLKF+TS+SR 
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K++L LVH DVWQ+ V SLGGA YFV FIDDYS+RCWVYPIKKK++V + FK FK +VEL  GKKIKC RTDNGGEY   EF +FC +EGIKRQFT AYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQQNGVA+RMNRTLLERTRAML A GL+K+FWAE VNT CY+VNR+PSTAIELKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

KAA0044949.1 hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa]4.3e-130100Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
        PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT

KAE8703216.1 Serine/threonine kinase [Hibiscus syriacus]8.8e-9971.43Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MK+ +GALV++K  K+ ANLYML+GETL E EASVAS SS  + +M+WH+KLGHMSE+G+KVLVE+ LLP LTKVSLP  EHC+TSKQHRLKFNTS+SR 
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K +L LVH DVWQ+LVTSLGGA YFV FIDDYS+RCWV+PIKKK+ V S FK FK +VEL  G KIKC R DNGGEY   EF +FC +EGIKRQFT A T
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQQNGVA+ MN+TLLERTRAML   GL+K+FWAE VNT CY+VNR+PSTAIELKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

KAE8711089.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Hibiscus syriacus]3.6e-10072.59Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MKV +GALV++K  K+ ANLYML+GETL E EASVAS SS  N +M+WH+KLGHMSE+G+KVLVE+ LLP LTKVSL   EHC+TSKQHRLKFNTS+SR 
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K +L LVH DVWQ+ VTSLGGA YFV FIDDYS+RCWV+PIKKK++V S FK FK +VEL YG KIKC RTDNGGEY   EF +FC +EGIKRQFT A T
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
         QQNGVA+RMNRTLLERTRAML   GL+K+FWAE VNT CY+VNR+PSTAIELKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

TYK16527.1 hypothetical protein E5676_scaffold21G003420 [Cucumis melo var. makuwa]1.9e-12294.96Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MKVIQG LVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSE+GLK  VE+NLLPELTKVSLPF EHCVTSKQHRLKFNTSSSRS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        KMIL LVHYDVWQSLVTSLGGA YFVFFIDDYSKRCWVYPIKKKT+V SVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
        PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWA+ VNT
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT

TrEMBL top hitse value%identityAlignment
A0A2N9FMR2 Uncharacterized protein1.2e-10472.97Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MK+++GALV+MK  K+ ANLYML+G+T QEGEAS A +SS E L+MMWHRKLGHMSE+GLK+L E+ LLP L KVSLPF EHCVTSKQHRLKF++SS+RS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K IL L+H DVWQ+ V SLGGA YFV FIDDYS+RCWVYPIK K +V SVFK+FK +VEL+  KKIKCLRTDNGGEY   EF  FC QEGIKRQFT AYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQQNGVA+RMNRTLLERTRAML   G+ K FWAE V T CY++NRSPSTAI+LKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

A0A2N9GY85 Glutaredoxin-dependent peroxiredoxin1.2e-10472.97Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MK+++GALV+MK  K+ ANLYML+G+T QEGEAS A +SS E L+MMWHRKLGHMSE+GLK+L E+ LLP L KVSLPF EHCVTSKQHRLKF++SS+RS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K IL L+H DVWQ+ V SLGGA YFV FIDDYS+RCWVYPIK K +V SVFK+FK +VEL+  KKIKCLRTDNGGEY   EF  FC QEGIKRQFT AYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQQNGVA+RMNRTLLERTRAML   G+ K FWAE V T CY++NRSPSTAI+LKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

A0A2N9HLU0 Uncharacterized protein1.2e-10472.97Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MK+++GALV+MK  K+ ANLYML+G+T QEGEAS A +SS E L+MMWHRKLGHMSE+GLK+L E+ LLP L KVSLPF EHCVTSKQHRLKF++SS+RS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        K IL L+H DVWQ+ V SLGGA YFV FIDDYS+RCWVYPIK K +V SVFK+FK +VEL+  KKIKCLRTDNGGEY   EF  FC QEGIKRQFT AYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQQNGVA+RMNRTLLERTRAML   G+ K FWAE V T CY++NRSPSTAI+LKTPM++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

A0A5A7TUN0 Uncharacterized protein2.1e-130100Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
        PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT

A0A5D3CXA6 Uncharacterized protein9.4e-12394.96Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        MKVIQG LVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSE+GLK  VE+NLLPELTKVSLPF EHCVTSKQHRLKFNTSSSRS
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
        KMIL LVHYDVWQSLVTSLGGA YFVFFIDDYSKRCWVYPIKKKT+V SVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
Subjt:  KMILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT
        PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWA+ VNT
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-3335.96Show/hide
Query:  ENLSMMWHRKLGHMSEKGLKVLVEKNLLPELT-----KVSLPFYEHCVTSKQHRLKFNTSSSRS--KMILLLVHYDVWQSLV-TSLGGASYFVFFIDDYS
        +N   +WH + GH+S+  L  +  KN+  + +     ++S    E C+  KQ RL F     ++  K  L +VH DV   +   +L   +YFV F+D ++
Subjt:  ENLSMMWHRKLGHMSEKGLKVLVEKNLLPELT-----KVSLPFYEHCVTSKQHRLKFNTSSSRS--KMILLLVHYDVWQSLV-TSLGGASYFVFFIDDYS

Query:  KRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWA
          C  Y IK K++V S+F+ F  + E  +  K+  L  DNG EY+ NE  +FC ++GI    T  +TPQ NGV++RM RT+ E+ R M+    L K+FW 
Subjt:  KRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWA

Query:  EVVNTVCYIVNRSPSTAI--ELKTPMQI
        E V T  Y++NR PS A+    KTP ++
Subjt:  EVVNTVCYIVNRSPSTAI--ELKTPMQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-5343.24Show/hide
Query:  KVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRSK
        ++ +G+LV+ K       LY    E  Q GE + A      +L   WH+++GHMSEKGL++L +K+L+      ++   ++C+  KQHR+ F TSS R  
Subjt:  KVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRSK

Query:  MILLLVHYDVWQSL-VTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT
         IL LV+ DV   + + S+GG  YFV FIDD S++ WVY +K K  V  VF+ F   VE + G+K+K LR+DNGGEY   EF E+C+  GI+ + T   T
Subjt:  MILLLVHYDVWQSL-VTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYT

Query:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI
        PQ NGVA+RMNRT++E+ R+ML    L K+FW E V T CY++NRSPS  +  + P ++
Subjt:  PQQNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.6e-1326.17Show/hide
Query:  HRKLGHMSEKGLKVLVEKNLLPELTKVSLPF-----YE--HCVTSKQHRLKFNTSSSRSKMILLLVHYDVWQSLVTSLGG---------ASYFVFFIDDY
        HR LGH + + ++  ++KN +  L +  + +     Y+   C+  K  + + +   SR K       Y+ +Q L T + G          SYF+ F D+ 
Subjt:  HRKLGHMSEKGLKVLVEKNLLPELTKVSLPF-----YE--HCVTSKQHRLKFNTSSSRSKMILLLVHYDVWQSLVTSLGG---------ASYFVFFIDDY

Query:  SKRCWVYPI--KKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQNGVAQRMNRTLLERTRAMLGAVGLKKA
        ++  WVYP+  +++ ++ +VF      ++ Q+  ++  ++ D G EY      +F    GI   +T     + +GVA+R+NRTLL   R +L   GL   
Subjt:  SKRCWVYPI--KKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQNGVAQRMNRTLLERTRAMLGAVGLKKA

Query:  FWAEVVNTVCYIVN
         W   V     I N
Subjt:  FWAEVVNTVCYIVN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.0e-2626.95Show/hide
Query:  RKVDANLYMLEGETLQE---------GEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFY--EHCVTSKQHRLKFNTSSSRSKM
        + ++  + +L+G+T  E            S+ +S S +     WH +LGH +   L  ++    L  L   S  F     C+ +K +++ F+ S+  S  
Subjt:  RKVDANLYMLEGETLQE---------GEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFY--EHCVTSKQHRLKFNTSSSRSKM

Query:  ILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQ
         L  ++ DVW S + S     Y+V F+D +++  W+YP+K+K+ V   F  FK  +E ++  +I    +DNGGE++     E+ +Q GI    +  +TP+
Subjt:  ILLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQ

Query:  QNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQ
         NG+++R +R ++E    +L    + K +W        Y++NR P+  ++L++P Q
Subjt:  QNGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-2927.84Show/hide
Query:  RKVDANLYMLEGETLQE---------GEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELT-KVSLPFYEHCVTSKQHRLKFNTSSSRSKMI
        + ++  + +L+G+T  E            S+ +S   +     WH +LGH S   L  ++  + LP L     L     C  +K H++ F+ S+  S   
Subjt:  RKVDANLYMLEGETLQE---------GEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELT-KVSLPFYEHCVTSKQHRLKFNTSSSRSKMI

Query:  LLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQ
        L  ++ DVW S + S+    Y+V F+D +++  W+YP+K+K+ V   F +FK  VE ++  +I  L +DNGGE++     ++ +Q GI    +  +TP+ 
Subjt:  LLLVHYDVWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQ

Query:  NGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQ
        NG+++R +R ++E    +L    + K +W    +   Y++NR P+  ++L++P Q
Subjt:  NGVAQRMNRTLLERTRAMLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQ

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-1335.71Show/hide
Query:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS
        +KV++G   ++K  + D +LY+L+G +++ GE+++A ++  E  + +WH +L HMS++G+++LV+K  L      SL F E C+  K HR+ F+T    +
Subjt:  MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRS

Query:  KMILLLVHYDVW
        K  L  VH D+W
Subjt:  KMILLLVHYDVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTAATCCAAGGTGCGCTAGTACTTATGAAAAGAAGAAAGGTGGATGCAAACTTGTACATGTTAGAGGGAGAAACTTTGCAGGAAGGAGAAGCATCTGTTGCTTC
AAGTAGTTCAGGTGAAAATCTCTCAATGATGTGGCATCGCAAATTAGGCCACATGTCTGAAAAAGGATTAAAAGTTCTTGTAGAGAAAAATCTACTCCCAGAGCTCACTA
AGGTGTCTCTACCCTTTTATGAGCATTGTGTTACAAGCAAGCAACATAGATTGAAGTTCAATACATCAAGTTCTAGAAGTAAAATGATTCTACTACTGGTTCATTATGAT
GTATGGCAATCACTGGTTACATCTCTTGGAGGAGCAAGTTACTTTGTGTTCTTTATAGATGATTATTCTAAAAGGTGTTGGGTGTATCCTATTAAGAAGAAGACAAATGT
ATGTTCTGTCTTCAAAGTATTCAAAGTGCAAGTGGAACTTCAATATGGTAAAAAGATCAAGTGTTTACGTACAGATAATGGAGGAGAATATATAAGAAATGAGTTTGTGG
AATTTTGTAATCAGGAAGGCATTAAAAGACAATTCACTGCTGCTTACACTCCTCAACAAAATGGAGTGGCACAGCGGATGAACAGAACCTTGCTAGAAAGAACAAGAGCA
ATGTTGGGAGCTGTAGGCTTAAAGAAAGCTTTCTGGGCAGAAGTTGTTAATACCGTCTGTTATATAGTGAATCGTTCTCCATCAACTGCAATTGAGTTAAAGACACCAAT
GCAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTAATCCAAGGTGCGCTAGTACTTATGAAAAGAAGAAAGGTGGATGCAAACTTGTACATGTTAGAGGGAGAAACTTTGCAGGAAGGAGAAGCATCTGTTGCTTC
AAGTAGTTCAGGTGAAAATCTCTCAATGATGTGGCATCGCAAATTAGGCCACATGTCTGAAAAAGGATTAAAAGTTCTTGTAGAGAAAAATCTACTCCCAGAGCTCACTA
AGGTGTCTCTACCCTTTTATGAGCATTGTGTTACAAGCAAGCAACATAGATTGAAGTTCAATACATCAAGTTCTAGAAGTAAAATGATTCTACTACTGGTTCATTATGAT
GTATGGCAATCACTGGTTACATCTCTTGGAGGAGCAAGTTACTTTGTGTTCTTTATAGATGATTATTCTAAAAGGTGTTGGGTGTATCCTATTAAGAAGAAGACAAATGT
ATGTTCTGTCTTCAAAGTATTCAAAGTGCAAGTGGAACTTCAATATGGTAAAAAGATCAAGTGTTTACGTACAGATAATGGAGGAGAATATATAAGAAATGAGTTTGTGG
AATTTTGTAATCAGGAAGGCATTAAAAGACAATTCACTGCTGCTTACACTCCTCAACAAAATGGAGTGGCACAGCGGATGAACAGAACCTTGCTAGAAAGAACAAGAGCA
ATGTTGGGAGCTGTAGGCTTAAAGAAAGCTTTCTGGGCAGAAGTTGTTAATACCGTCTGTTATATAGTGAATCGTTCTCCATCAACTGCAATTGAGTTAAAGACACCAAT
GCAGATTTGA
Protein sequenceShow/hide protein sequence
MKVIQGALVLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMWHRKLGHMSEKGLKVLVEKNLLPELTKVSLPFYEHCVTSKQHRLKFNTSSSRSKMILLLVHYD
VWQSLVTSLGGASYFVFFIDDYSKRCWVYPIKKKTNVCSVFKVFKVQVELQYGKKIKCLRTDNGGEYIRNEFVEFCNQEGIKRQFTAAYTPQQNGVAQRMNRTLLERTRA
MLGAVGLKKAFWAEVVNTVCYIVNRSPSTAIELKTPMQI