; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222841 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222841
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr08:11930981..11931934
RNA-Seq ExpressionCmc08g0222841
SyntenyCmc08g0222841
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8669333.1 hypothetical protein F3Y22_tig00112249pilonHSYRG00290 [Hibiscus syriacus]3.9e-11766.78Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E  G RVVV A+NS+LPI  +GKT++ P  N+NQV+L  V++VPGMKKNL+SV+QLTS+ ++V+FGP DVKVY ++K+S TP MEGRR++SIYVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET+DLWH RLGH+ Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGK HQLP+ ESKF+ K+PLELVHSDVFGPVKQ SI GMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY W+FFMKEK +T + FKEF++  E E+ K+I CLR+DNRGEY SNEFSQYL++ +I HQ TC NT + NG+AERKNRH  ++ RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

KAE8687058.1 hypothetical protein F3Y22_tig00111024pilonHSYRG00006 [Hibiscus syriacus]1.8e-11767.44Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E  G RVVV A+NS+LPI  +GKT++ P  N+NQV+L  V++VPGMKKNL+SV+QLTS+ ++V+FGP DVKVY ++K++ TP MEGRR++SIYVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET+DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+ ESKF+ K+PLELVHSDVFGPVKQ SI GMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF++  E E+ K+I CLR+DN GEY SNEFSQYL++ +I HQ TC NT ++NG+AERKNRH A++ RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

KAE8705435.1 hypothetical protein F3Y22_tig00110429pilonHSYRG01243 [Hibiscus syriacus]1.8e-11767.44Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E  G RVVV A+NS+LPI  +GKT++ P  N+NQV+L  V++VPGMKKNL+SV+QLTS+ ++V+FGP DVKVY ++K++ TP MEGRR++SIYVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET+DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+ ESKF+ K+PLELVHSDVFGPVKQ SI GMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF++  E E+ K+I CLR+DN GEY SNEFSQYL++ +I HQ TC NT ++NG+AERKNRH A++ RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

KAE8715296.1 hypothetical protein F3Y22_tig00110183pilonHSYRG00102 [Hibiscus syriacus]1.8e-11767.44Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E  G RVVV A+NS+LPI  +GKT++ P  N+NQV+L  V++VPGMKKNL+SV+QLTS+ ++V+FGP DVKVY ++K++ TP MEGRR++SIYVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET+DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+ ESKF+ K+PLELVHSDVFGPVKQ SI GMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF++  E E+ K+I CLR+DN GEY SNEFSQYL++ +I HQ TC NT ++NG+AERKNRH A++ RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

TYK27792.1 Integrase, catalytic core [Cucumis melo var. makuwa]7.1e-11993.67Show/hide
Query:  MIGLLIQDVLTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEG
        MIGLLIQDVLTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEG
Subjt:  MIGLLIQDVLTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEG

Query:  RRIKSIYVMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQ
        RRIKSIYVMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQ
Subjt:  RRIKSIYVMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQ

Query:  SSIGGMRYMVTFIDDFSRYAWV-FFMKEKFETITNFK
        SSIGGMRYMVTFIDDFSR        K ++E + N K
Subjt:  SSIGGMRYMVTFIDDFSRYAWV-FFMKEKFETITNFK

TrEMBL top hitse value%identityAlignment
A0A2N9EJM7 Uncharacterized protein7.0e-12069.1Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E +G RVVV A+NS+LPIA +GKT++ P  NSNQV L  V++VPGMKKNL+SV+QLT + ++V+FGP DVKVY +LK+S TP+MEG+R++S+YVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV KT+KNET DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+KESKF+ K+PLELVHSDVFGPVKQ SIGGMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF+E  E E+ K+I CLR+DN GEY S+EFSQYL++ +I HQ TC NT ++NG+AERKNRH A+V RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

A0A2N9F162 Uncharacterized protein1.6e-11968.77Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E +G RVVV A+NS+LPIA +GKT++ P  NSNQV L  V++VPGMKKNL+SV+QLT + ++V+FGP DVKVY +LK+S TP+MEG+R++S+YVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+KESKF+ K+PLELVHSDVFGPVKQ SIGGMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF+E  E E+ K+I CLR+DN GEY S+EFSQYL++ +I HQ TC NT ++NG+AERKNRH A+V RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

A0A2N9GKM9 Uncharacterized protein1.2e-11969.1Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E +G RVVV A+NS+LPIA +GKT++ P  NSNQV L  V++VPGMKKNL+SV+QLT + ++V+FGP DVKVY +LK+S TP+MEG+R++S+YVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV KT+KNET DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+KESKF+ K+PLELVHSDVFGPVKQ SIGGMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF+E  E E+ K+I CLR+DN GEY S+EFSQYL++ +I HQ TC NT ++NG+AERKNRH A+V RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

A0A2N9HMH0 Uncharacterized protein9.1e-12069.1Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E +G RVVV A+NS+LPIA +GKT++ P  NSNQV L  V++VPGMKKNL+SV+QLT + ++V+FGP DVKVY +LK+S TP+MEG+R++S+YVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV KT+KNET DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+KESKF+ K+PLELVHSDVFGPVKQ SIGGMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF+E  E E+ K+I CLR+DN GEY S+EFSQYL++ +I HQ TC NT ++NG+AERKNRH A+V RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

A0A2N9HXV3 Uncharacterized protein7.0e-12069.1Show/hide
Query:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA
        E +G RVVV A+NS+LPIA +GKT++ P  NSNQV L  V++VPGMKKNL+SV+QLT + ++V+FGP DVKVY +LK+S TP+MEG+R++S+YVMS E A
Subjt:  EIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIA

Query:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
        YV +T+KNET DLWH RLGHV Y+KL  ++ K MLKGLPQLD++ D VCAGCQYGKAHQLP+KESKF+ K+PLELVHSDVFGPVKQ SIGGMRYMVTFID
Subjt:  YVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR
        DFSRY WVFFMKEK +T + FKEF+E VE E+ K+I CLR+DN GEY S+EFSQYL++ +I HQ TC NT ++NG+AERKNRH A+V RSMLHA NV GR
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGR

Query:  F
        F
Subjt:  F

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-2830.3Show/hide
Query:  KLPIAQVG------KTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHN----LKVSGTPLMEGRRIKSIYVMSVEIAYVKK
        K+ +A+ G      K  IV   N +++ L+ V +      NLMSV +L  A   + F    V +  N    +K SG        + ++ V++ +   +  
Subjt:  KLPIAQVG------KTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHN----LKVSGTPLMEGRRIKSIYVMSVEIAYVKK

Query:  TQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKE--DMVCAGCQYGKAHQLPFKE--SKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID
          KN    LWH R GH+   KL  I  K M      L+  E    +C  C  GK  +LPFK+   K  +K+PL +VHSDV GP+   ++    Y V F+D
Subjt:  TQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKE--DMVCAGCQYGKAHQLPFKE--SKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFID

Query:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNV
         F+ Y   + +K K +  + F++F  + E     ++  L  DN  EY+SNE  Q+  K  I + LT P+T + NG++ER  R   +  R+M+  + +
Subjt:  DFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-3533.56Show/hide
Query:  VVIANNSKLPIAQVGKTMI---VPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIAY--V
        V + N S   IA +G   I   V C+    + L  V +VP ++ NL+S   L        F     ++     V    +  G    ++Y  + EI    +
Subjt:  VVIANNSKLPIAQVGKTMI---VPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMSVEIAY--V

Query:  KKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFIDDF
           Q   + DLWH R+GH+    L+ +  K ++       +K    C  C +GK H++ F+ S  R    L+LV+SDV GP++  S+GG +Y VTFIDD 
Subjt:  KKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFIDDF

Query:  SRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNV
        SR  WV+ +K K +    F++F   VE E  ++++ LRSDN GEY S EF +Y   + I H+ T P T + NG+AER NR   +  RSML  + +
Subjt:  SRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNV

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein3.6e-1222.08Show/hide
Query:  VVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMS--------VEI
        +V A    +PI  +G  +     N  +  +    + P +  +L+S+S+LT+ +    F  + ++     +  GT L    +    Y +S        +  
Subjt:  VVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYVMS--------VEI

Query:  AYVKKTQKNETAD-----LWHARLGHVIYNKLKTIINKFMLKGLPQLDIK----EDMVCAGCQYGKA--------HQLPFKESKFRVKQPLELVHSDVFG
          +    K+++ +     L H  LGH  +  ++  + K  +  L + DI+        C  C  GK+         +L ++ES     +P + +H+D+FG
Subjt:  AYVKKTQKNETAD-----LWHARLGHVIYNKLKTIINKFMLKGLPQLDIK----EDMVCAGCQYGKA--------HQLPFKESKFRVKQPLELVHSDVFG

Query:  PVKQSSIGGMRYMVTFIDDFSRYAWVFFMKE-KFETITN-FKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERK
        PV         Y ++F D+ +R+ WV+ + + + E+I N F      ++N+   R+  ++ D   EY +    ++     I    T    S  +G+AER 
Subjt:  PVKQSSIGGMRYMVTFIDDFSRYAWVFFMKE-KFETITN-FKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERK

Query:  NRHPAKVYRSMLHASNV
        NR      R++LH S +
Subjt:  NRHPAKVYRSMLHASNV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-3632.57Show/hide
Query:  LTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNF-VVFGPDDVKVYHNLKVSGTPLMEGRRIKSIY-
        L++ Q   G   V++A+ S +PI+  G T +   + S  + L ++ YVP + KNL+SV +L +A+   V F P   +V  +L  +G PL++G+    +Y 
Subjt:  LTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNF-VVFGPDDVKVYHNLKVSGTPLMEGRRIKSIY-

Query:  ---VMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVF-GPVKQSSI
             S  ++         T   WHARLGH   + L ++I+ + L  L      + + C+ C   K++++PF +S     +PLE ++SDV+  P+   S 
Subjt:  ---VMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVF-GPVKQSSI

Query:  GGMRYMVTFIDDFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYR
           RY V F+D F+RY W++ +K+K +    F  FK  +EN  + RI    SDN GE+++    +Y  ++ I H  + P+T E NGL+ERK+RH  +   
Subjt:  GGMRYMVTFIDDFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYR

Query:  SML-HAS
        ++L HAS
Subjt:  SML-HAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.0e-3634.09Show/hide
Query:  LTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNF-VVFGPDDVKVYHNLKVSGTPLMEGRRIKSIY-
        L+  Q   G   V+IA+ S +PI   G   +   ++S  ++L+ V YVP + KNL+SV +L + +   V F P   +V  +L  +G PL++G+    +Y 
Subjt:  LTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNF-VVFGPDDVKVYHNLKVSGTPLMEGRRIKSIY-

Query:  ---VMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMV-CAGCQYGKAHQLPFKESKFRVKQPLELVHSDVF-GPVKQSS
             S  ++         T   WH+RLGH     L ++I+      LP L+    ++ C+ C   K+H++PF  S     +PLE ++SDV+  P+   S
Subjt:  ---VMSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMV-CAGCQYGKAHQLPFKESKFRVKQPLELVHSDVF-GPVKQSS

Query:  IGGMRYMVTFIDDFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVY
        I   RY V F+D F+RY W++ +K+K +    F  FK  VEN  + RI  L SDN GE++      YL ++ I H  + P+T E NGL+ERK+RH  ++ 
Subjt:  IGGMRYMVTFIDDFSRYAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVY

Query:  RSML-HAS
         ++L HAS
Subjt:  RSML-HAS

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.8e-1033.65Show/hide
Query:  LMEGRRIKSIYVM--SVEI--AYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHS
        +++G R  S+Y++  SVE   + + +T K+ET  LWH+RL H+    ++ ++ K  L       +K    C  C YGK H++ F   +   K PL+ VHS
Subjt:  LMEGRRIKSIYVM--SVEI--AYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHS

Query:  DVFG
        D++G
Subjt:  DVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGATTGGATTGCTGATTCAAGATGTTCTAACCATACGACAAGAGATAAGAGGAAGCCGAGTTGTTGTAATTGCAAACAACTCGAAGTTGCCAATAGCTCAGGT
TGGTAAAACTATGATAGTGCCTTGTTCTAATTCTAATCAAGTGGAATTAGATCATGTATTTTATGTTCCTGGAATGAAGAAGAATTTGATGTCAGTATCTCAATTGACTT
CAGCAGACAACTTCGTCGTATTTGGACCTGACGATGTAAAGGTGTATCATAATCTTAAAGTAAGTGGTACACCGTTGATGGAAGGACGAAGGATAAAGTCCATCTACGTT
ATGTCAGTAGAGATCGCCTACGTGAAAAAGACGCAAAAGAATGAAACAGCAGATTTGTGGCATGCAAGACTTGGTCATGTTATCTACAACAAATTAAAGACAATAATAAA
CAAGTTCATGTTGAAGGGGTTGCCACAACTTGATATCAAAGAAGACATGGTATGTGCTGGTTGCCAGTATGGGAAAGCACATCAACTACCATTTAAGGAGTCCAAATTCA
GAGTAAAACAACCATTGGAGTTGGTGCATTCAGATGTATTTGGTCCGGTCAAACAATCTTCAATCGGTGGAATGCGCTACATGGTAACCTTTATCGATGACTTCTCTAGG
TATGCTTGGGTGTTTTTTATGAAAGAGAAGTTTGAAACAATTACAAACTTTAAAGAATTCAAAGAACAAGTTGAAAATGAGTTAGAAAAGAGAATTCAATGTTTACGTTC
AGATAATAGGGGAGAATATATCTCTAATGAATTCTCTCAATACTTGAAAAAATATAAGATATATCATCAGTTAACGTGTCCAAACACTTCAGAAAAAAATGGACTGGCAG
AAAGAAAAAATAGACATCCTGCAAAAGTATATCGTAGTATGTTACATGCAAGTAATGTTTTAGGAAGATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGATTGGATTGCTGATTCAAGATGTTCTAACCATACGACAAGAGATAAGAGGAAGCCGAGTTGTTGTAATTGCAAACAACTCGAAGTTGCCAATAGCTCAGGT
TGGTAAAACTATGATAGTGCCTTGTTCTAATTCTAATCAAGTGGAATTAGATCATGTATTTTATGTTCCTGGAATGAAGAAGAATTTGATGTCAGTATCTCAATTGACTT
CAGCAGACAACTTCGTCGTATTTGGACCTGACGATGTAAAGGTGTATCATAATCTTAAAGTAAGTGGTACACCGTTGATGGAAGGACGAAGGATAAAGTCCATCTACGTT
ATGTCAGTAGAGATCGCCTACGTGAAAAAGACGCAAAAGAATGAAACAGCAGATTTGTGGCATGCAAGACTTGGTCATGTTATCTACAACAAATTAAAGACAATAATAAA
CAAGTTCATGTTGAAGGGGTTGCCACAACTTGATATCAAAGAAGACATGGTATGTGCTGGTTGCCAGTATGGGAAAGCACATCAACTACCATTTAAGGAGTCCAAATTCA
GAGTAAAACAACCATTGGAGTTGGTGCATTCAGATGTATTTGGTCCGGTCAAACAATCTTCAATCGGTGGAATGCGCTACATGGTAACCTTTATCGATGACTTCTCTAGG
TATGCTTGGGTGTTTTTTATGAAAGAGAAGTTTGAAACAATTACAAACTTTAAAGAATTCAAAGAACAAGTTGAAAATGAGTTAGAAAAGAGAATTCAATGTTTACGTTC
AGATAATAGGGGAGAATATATCTCTAATGAATTCTCTCAATACTTGAAAAAATATAAGATATATCATCAGTTAACGTGTCCAAACACTTCAGAAAAAAATGGACTGGCAG
AAAGAAAAAATAGACATCCTGCAAAAGTATATCGTAGTATGTTACATGCAAGTAATGTTTTAGGAAGATTTTAG
Protein sequenceShow/hide protein sequence
MKMIGLLIQDVLTIRQEIRGSRVVVIANNSKLPIAQVGKTMIVPCSNSNQVELDHVFYVPGMKKNLMSVSQLTSADNFVVFGPDDVKVYHNLKVSGTPLMEGRRIKSIYV
MSVEIAYVKKTQKNETADLWHARLGHVIYNKLKTIINKFMLKGLPQLDIKEDMVCAGCQYGKAHQLPFKESKFRVKQPLELVHSDVFGPVKQSSIGGMRYMVTFIDDFSR
YAWVFFMKEKFETITNFKEFKEQVENELEKRIQCLRSDNRGEYISNEFSQYLKKYKIYHQLTCPNTSEKNGLAERKNRHPAKVYRSMLHASNVLGRF