; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G008880 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G008880
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr09:4615294..4619610
RNA-Seq ExpressionCmoCh09G008880
SyntenyCmoCh09G008880
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW33283.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.3e-16083.94Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLS LTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TF+H SKARELRLKDDLQLMKRGTKPVAEYAR FK +C+QLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TN  RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

RVW43615.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-15883.38Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FST QM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

RVW45095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.6e-16184.23Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+ VVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR  T  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

RVW62468.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.3e-16083.66Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQ HAIGRPVED DKVHWFLRGL T+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R +SS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDS+IVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

XP_019081420.1 PREDICTED: uncharacterized protein LOC109124128 isoform X1 [Vitis vinifera]8.4e-16483.61Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLG +FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSNRKGGGNR
        TSCS++GP+ ADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHTESSNRKGGG+R
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSNRKGGGNR

TrEMBL top hitse value%identityAlignment
A0A438C7N7 Retrovirus-related Pol polyprotein from transposon RE11.5e-15883.38Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITI+LSSSNYLLWKSQLLPLLESQD+LGYVDGT VP PRFEP TS+ L+ KYLAW+AADQRLLCLLLSSLTEEA+A VVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKR TKPVAE+AR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQMALT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T TAFT TNR RT  HG+  A  +NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQ+Y R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

A0A438DCU0 Retrovirus-related Pol polyprotein from transposon RE12.1e-16083.94Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLS LTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TF+H SKARELRLKDDLQLMKRGTKPVAEYAR FK +C+QLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TN  RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

A0A438E763 Retrovirus-related Pol polyprotein from transposon RE15.1e-15983.38Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FST QM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-16184.23Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+ VVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR  T  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

A0A438FR86 Retrovirus-related Pol polyprotein from transposon RE12.1e-16083.66Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQ HAIGRPVED DKVHWFLRGL T+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R +SS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDS+IVGNGASLPITHT
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-2926.48Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     + +  +NP Y  W+  D+ +   +L +++      V    TA  +W  L   +++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF
          +L+  L+   +GTK + +Y +      DQL  +G+P++  ++V   L  L  E+        A  + P   ++  +  + E   L++ S+   P TA 
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF

Query:  TVTNRGRTHGSHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI
         V++R  T  ++  +    R   Y ++NN++N           H +  +  P+   CQIC  +GH A RC+Q      S ++    +  T      + ++
Subjt:  TVTNRGRTHGSHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI

Query:  AGP-DTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSN
          P  + +W LD+GA+ H+T+D + L   + YTG D V+V +G+++PI+HT S++
Subjt:  AGP-DTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2527.61Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     +    +NP Y  WR  D+ +   +L +++      V    TA  +W  L   +++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF
          +L+                   F    DQL  +G+P++  ++V   L  L  ++        A  + P   ++  +  + E   L+L S+   P TA 
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF

Query:  TVTNRGRTHGSHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI
         VT+R      +     N RG  R+Y++ NN SN  +  SS  R   R P      CQIC  +GH A RC Q +    +++   + +  T      + ++
Subjt:  TVTNRGRTHGSHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI

Query:  AGPDTA-DWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSN
          P  A +W LD+GA+ H+T+D + L   + YTG D V++ +G+++PITHT S++
Subjt:  AGPDTA-DWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTESSN

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-1227.6Show/hide
Query:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLPTARDVWLALETTFSHQS
        + + +  SNY  W+   L    S D++G++DGT +P            N   + W+  D  +   L  +LT ++     V   T+RD+WL ++  F +  
Subjt:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLPTARDVWLALETTFSHQS

Query:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSK-AESFELFQLSLE------
         AR LRL  +L+    G   VA+Y R  KK+ D L  +  PV D + V + L GL  +F             P F D  +   E  +  + +++      
Subjt:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSK-AESFELFQLSLE------

Query:  --SSNSTPTAFT----VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGR
          SS+ST  A +    VTN  R+ G       NQ G     + N+  RGR
Subjt:  --SSNSTPTAFT----VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGAATCTTCTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACCAGGGTTCCACCACCTCGCTTTGAACCAGAAACCTCTTCAACACTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCCCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTTCAGCCATCAGTCGAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACCAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCGGTCGAGGACATTGATAAAGTGCACTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTTACCTCTATCCCCTGTTTTGCAGATTTAGTCTCTAAAGCTGAAAGTTTTGAGTTGTTTCAGCTCTCCCTTGAGTCCTCTAACTCCACTCCTACAGCATTCACA
GTCACTAATCGTGGTCGCACCCATGGAAGCCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAATAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAAGGTCGTCGACCACCTCACTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATACTGCTGATTGGTTTCTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCTATTTTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCTTCCCTACCCATTACCCACACCGAATCGTCAAACAGGAAGGGTGGTGGCAACCGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGAATCTTCTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACCAGGGTTCCACCACCTCGCTTTGAACCAGAAACCTCTTCAACACTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCCCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTTCAGCCATCAGTCGAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACCAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCGGTCGAGGACATTGATAAAGTGCACTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTTACCTCTATCCCCTGTTTTGCAGATTTAGTCTCTAAAGCTGAAAGTTTTGAGTTGTTTCAGCTCTCCCTTGAGTCCTCTAACTCCACTCCTACAGCATTCACA
GTCACTAATCGTGGTCGCACCCATGGAAGCCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAATAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAAGGTCGTCGACCACCTCACTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATACTGCTGATTGGTTTCTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCTATTTTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCTTCCCTACCCATTACCCACACCGAATCGTCAAACAGGAAGGGTGGTGGCAACCGGTAAAAGAG
ATGGAGGGCTATATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAGAAACAAATCTTTACGTGCTTCATATGATTTATGGCATGCTCGTCTAGGTCATGTG
AATCATTCTGTTATTTCTTTTTTAAATAGAAAAGGTCATCTTTCTCTTACGTCTTTATTGCCTTCTCCATCATTATGTAATACCTGTCAGCTTGCAAAAAGTCATCGATT
GCCTTATTCCCGCAATGAACGTAGGTCGTCTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCGTCAAATCAAATTCGGGTTTCCTTTATTATGTTA
TTTTTATTGATGATTATTCTCGATTCACTTGGTTTTACCCTTTAAAATTTAAATCTGATTTTTTTGATATTTTTCTTCAATTTCAAAAATTTGTGGAAAATCAATATTCT
TCTCGTATCAAGGTATTTCAAAGTGATGGTGGTACCGAATTTACTAATACTTGTTTCAAAACTCATTTACGTAATTCTGGCATCCACCATCAACTCTCTTGTCCATATAC
ACCTGCTCAAAATGGTCGTGCTGAGAGAAAACATCGTCATGTGACTGAGACTGGCTTGGCCCTTCTCTTTCACTCTCATCTTTCTTCTCGTTTTTGGGTTGACGCCTTCA
GCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTT
GGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGCTCATAAAGGGTTTCGCTG
TCTTGATCCGGCCACCACTAAGCTATATATCACCTGTCATGCTCAATTTGATGAAACCCACTTTCCTGCTATCCCTAGCTCCCAGACCCAACCTCTTTCCTCTATTCCTA
TTTCAAATTTCTTAGAACCACATTTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCAAACTCCTCGATCCAGTTCATCCCCGTGTGATATTTGTTCTGAC
CTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGCCACCTTCGACTTCTAATTCGACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGG
CACTCATCCTATGATCACACGAGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATTTTGGGCTCATCTGGACTTCTTTCTGCTCTTCTTGCATCCA
CTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCAAGCATTACAACAAAATGATACTTGGACTTTGGTTCCTCGC
CCTGCTAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCTGATGGATCTGTCGAGCGTTTCAAGGCTCGTCTTGTTGCCAAAGGTTATACTCA
GGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCGGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTG
ATGTCAAGAATGCTTTTCTCAATGGAACTCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATATTGATCCTCGATTTCCCACTCATGTTTGTCTATTAAAGAAA
GCCCTCTATGGTTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGCAGTCGCGCTGACACGTCCCTTTTTGTCTT
TCATCAGCAATCTAACATTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTGACAGCTTTACTCGCAAGCTTCATTCTGAGT
TTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATTAGTCAGTTAAAATATGCTCGAGATATTCTTACT
CGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCTATGGTTGTTTCTCAACACTTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGT
TGGCGCCCTTCAGTACTTGACTATTACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAAC
GTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCGTCCATCCAATGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCC
GATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTGGTTTCTTGGAGTGCCAAAAAGCAACCTACTGTCTCACGCTCCAGCTGTGAATCTGAGTA
TCGTGCTCTTGCCACAACCGCTGCTGAACTTCTTTGGGTTACGCATCTTTTGCATGACCTCAAGGTCCCTATTTCAAAGCAGCCCTTACTCTTATGTGACAACAAAAGTG
CTATTTTTTTGAGCTCTAATCCCGTTTCTCACAAGCGGGCCAAACATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGCACACAATAT
GTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAAGCTTTACGTTCGTTCAAATCCGACGCTCAGCTT
GCGGGGGGGTGTTAAGGATAGTTGACCGTCACTATAAGGCAAATATCTAGATATTTGCCTTACATTACGGGTTACCATATTTGCCCTCTTATTAAAGGCAAATATATTGC
ATTCATTTATTATTCTTTCCTTTTATTACTATTTCCATAATATTTGTAATTTACCATATTTCTGTATTCTGAAAATAATTATAAATAAGAGAACTCTCACCCCC
Protein sequenceShow/hide protein sequence
MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALE
TTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFT
VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTSCSIAGPDTADWFLDTGASAHMTADPSILDQ
SKNYTGKDSVIVGNGASLPITHTESSNRKGGGNR