; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0130471 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0130471
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr05:8938857..8940059
RNA-Seq ExpressionCmc05g0130471
SyntenyCmc05g0130471
Gene Ontology termsGO:0009231 - riboflavin biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008686 - 3,4-dihydroxy-2-butanone-4-phosphate synthase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP36562.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.0e-16771.86Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +  LK+ ++S PI KK    + C FC K GH++K+C KRKAWFE KGK NA VCFESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        F T +T +PNE+F+FMGNRVKV VEAV TY L L+TGHHLDL +T YVPS+SRNL+SLSKLD  GY F FGNGCFSLFK+N  IG+GILCDGLYKL LD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E+LLTLHHN+GTKR   NE SA+LWH+RL HIS+ER++RLIKNEILP+LDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G EKYFITFI+D+SRYGY+YLLHEKSQA+DAL++++NEVERQLD+ VK++RSDRGGEYYG+Y+E GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

RYE18822.1 hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium]3.9e-16275.2Show/hide
Query:  MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMH
        +GHKGA KKP  K GKG +G  K+ +SS  IHKK Q  D CRFC K GH++K+C KRK WFE KGK +A VCFESN  EVPYNTWW+DSGCT HVSNTM 
Subjt:  MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMH

Query:  GFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLD
        GFLTT+T + NE+FI MGNR KV VEA+ TY L LDTGHHLDLF TFYVPS+SRNL+S+SKLD +GY F FGNGCFSLFKQN+F+GSGILCDGLYKLKLD
Subjt:  GFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLD

Query:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS
          FAE+LLT+HHNVGTKRG +NESSAYLWHKRL HISKERI+RL+KN+ILP+LDFTDLG+ V+CIKGK T+ T+ K ATRSSQLLEIIHTDIC  FDV S
Subjt:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS

Query:  FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENG
         GGE+YFITFI+DFSRYGY+YLLHEKSQ++D L+VF+NEVERQLDR VKI+RSDRGGEYYG+YDENG
Subjt:  FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENG

RZC09906.1 B2 protein isoform D [Glycine soja]1.7e-14164.82Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NAL                              G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PN++F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDR GEYY +YDE GQ   PFAK L+  G CAQYTMPGT QQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

RZC12927.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja]3.1e-16772.36Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGK   KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NALV FESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PNE+F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYYG+YDE GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]8.0e-16872.61Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PNE+F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY +YDE GQ P+PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

TrEMBL top hitse value%identityAlignment
A0A151R237 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-16771.86Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +  LK+ ++S PI KK    + C FC K GH++K+C KRKAWFE KGK NA VCFESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        F T +T +PNE+F+FMGNRVKV VEAV TY L L+TGHHLDL +T YVPS+SRNL+SLSKLD  GY F FGNGCFSLFK+N  IG+GILCDGLYKL LD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E+LLTLHHN+GTKR   NE SA+LWH+RL HIS+ER++RLIKNEILP+LDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G EKYFITFI+D+SRYGY+YLLHEKSQA+DAL++++NEVERQLD+ VK++RSDRGGEYYG+Y+E GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

A0A445KGB1 B2 protein isoform D8.2e-14264.82Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NAL                              G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PN++F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDR GEYY +YDE GQ   PFAK L+  G CAQYTMPGT QQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

A0A445KPR8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A1.5e-16772.36Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGK   KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NALV FESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PNE+F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYYG+YDE GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-16872.61Show/hide
Query:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG
        G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTM G
Subjt:  GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHG

Query:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN
        FLT +T +PNE+F+FMGNRVK  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK N  IG+G+LCDGLYKLKLD 
Subjt:  FLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDN

Query:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF
        ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS ERI+RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Subjt:  VFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF

Query:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE
        G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY +YDE GQ P+PFAK L+  G CAQYTMPGTPQQNGV+E
Subjt:  GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE

A0A4Q3EHL3 Uncharacterized protein (Fragment)1.9e-16275.2Show/hide
Query:  MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMH
        +GHKGA KKP  K GKG +G  K+ +SS  IHKK Q  D CRFC K GH++K+C KRK WFE KGK +A VCFESN  EVPYNTWW+DSGCT HVSNTM 
Subjt:  MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMH

Query:  GFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLD
        GFLTT+T + NE+FI MGNR KV VEA+ TY L LDTGHHLDLF TFYVPS+SRNL+S+SKLD +GY F FGNGCFSLFKQN+F+GSGILCDGLYKLKLD
Subjt:  GFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLD

Query:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS
          FAE+LLT+HHNVGTKRG +NESSAYLWHKRL HISKERI+RL+KN+ILP+LDFTDLG+ V+CIKGK T+ T+ K ATRSSQLLEIIHTDIC  FDV S
Subjt:  NVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS

Query:  FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENG
         GGE+YFITFI+DFSRYGY+YLLHEKSQ++D L+VF+NEVERQLDR VKI+RSDRGGEYYG+YDENG
Subjt:  FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-3928.74Show/hide
Query:  KGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLK-RKAWFENKGKHN-----ALVCFESNLT------------EVPYNTW
        +G G+   + +    R   + K  +     K ++++ C  CN+PGH+K++C   RK   E  G+ N     A+V    N+               P + W
Subjt:  KGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLK-RKAWFENKGKHN-----ALVCFESNLT------------EVPYNTW

Query:  WIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFI
         +D+  + H +      L  R    +   + MGN     +  +   C+  + G  L L D  +VP +  NLIS   LD  GY   F N  + L K ++ I
Subjt:  WIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFI

Query:  GSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLL
          G+    LY+   +              G      +E S  LWHKR+ H+S++ ++ L K  ++     T +     C+ GKQ + +    + R   +L
Subjt:  GSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLL

Query:  EIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQ
        +++++D+C   ++ S GG KYF+TFI+D SR  ++Y+L  K Q     + F   VER+  R +K LRSD GGEY  +          F ++  SHG   +
Subjt:  EIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQ

Query:  YTMPGTPQQNGVAE
         T+PGTPQ NGVAE
Subjt:  YTMPGTPQQNGVAE

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.3e-1123.64Show/hide
Query:  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNI
        IDSG +  +  + H +L   T N +E  I    +  + + A+         G    +    + P+I+ +L+SLS+L            CF   +L + + 
Subjt:  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNI

Query:  FIGSGILCDG-LYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS-------VDCIKGKQTKHTVN
         + + I+  G  Y L    +    +  L  N   K    N+    L H+ L H +   I++ +K   +  L  +D+  S        DC+ GK TKH   
Subjt:  FIGSGILCDG-LYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS-------VDCIKGKQTKHTVN

Query:  K----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC
        K    +   S +  + +HTDI             YFI+F ++ +R+ ++Y LH++ +   ++     +  ++ Q +  V +++ DRG EY  K       
Subjt:  K----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC

Query:  PAPFAKFLESHGRCAQYTMPGTPQQNGVAE
             KF  + G  A YT     + +GVAE
Subjt:  PAPFAKFLESHGRCAQYTMPGTPQQNGVAE

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein1.0e-1123.64Show/hide
Query:  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNI
        IDSG +  +  + H +L   T N +E  I    +  + + A+         G    +    + P+I+ +L+SLS+L            CF   +L + + 
Subjt:  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNI

Query:  FIGSGILCDG-LYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS-------VDCIKGKQTKHTVN
         + + I+  G  Y L    +    +  L  N   K    N+    L H+ L H +   I++ +K   +  L  +D+  S        DC+ GK TKH   
Subjt:  FIGSGILCDG-LYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS-------VDCIKGKQTKHTVN

Query:  K----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC
        K    +   S +  + +HTDI             YFI+F ++ +R+ ++Y LH++ +   ++     +  ++ Q +  V +++ DRG EY  K       
Subjt:  K----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC

Query:  PAPFAKFLESHGRCAQYTMPGTPQQNGVAE
             KF  + G  A YT     + +GVAE
Subjt:  PAPFAKFLESHGRCAQYTMPGTPQQNGVAE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-2125Show/hide
Query:  GKKPGKKNGKGNRGHLKV-KQSSAPIH-KKGQIK---DKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCF-----ESNLT-EVPY--NTWWIDSGCTI
        G +  + + + N  + K  +QSS   H    Q K    KC+ C   GH  K C + + +  +         F      +NL    PY  N W +DSG T 
Subjt:  GKKPGKKNGKGNRGHLKV-KQSSAPIH-KKGQIK---DKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCF-----ESNLT-EVPY--NTWWIDSGCTI

Query:  HVSNTMHGF-LTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNGCFSLFKQN--IFIGSG
        H+++  +   L    T  ++  +  G+ + +S     T   +L T    L+L +  YVP+I +NLIS+ +L + +G   +F    F +   N  + +  G
Subjt:  HVSNTMHGF-LTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNGCFSLFKQN--IFIGSG

Query:  ILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEI
           D LY+  + +    SL             +++++   WH RL H +   +  +I N  L  L+ +   +S  DC+  K  K   ++    S++ LE 
Subjt:  ILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEI

Query:  IHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYT
        I++D+  S  + S    +Y++ F++ F+RY ++Y L +KSQ  +    F N +E +    +    SD GGE+   ++           +   HG     +
Subjt:  IHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYT

Query:  MPGTPQQNGVAE
         P TP+ NG++E
Subjt:  MPGTPQQNGVAE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2125.3Show/hide
Query:  HKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCF-----ESNL-TEVPY--NTWWIDSGCTIH
        ++G  +     N + N        S +   +      +C+ C+  GH  K C +   +     +  +   F      +NL    PY  N W +DSG T H
Subjt:  HKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCF-----ESNL-TEVPY--NTWWIDSGCTIH

Query:  VSNTMHGFLTTRT-TNPNERFIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNGCFSLFKQN--IFIGSGI
        +++  +     +  T  ++  I  G+ + ++     T   +L T    LDL    YVP+I +NLIS+ +L +T+    +F    F +   N  + +  G 
Subjt:  VSNTMHGFLTTRT-TNPNERFIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNGCFSLFKQN--IFIGSGI

Query:  LCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEII
          D LY+  + +  A S+              ++++   WH RL H S   +  +I N  LP L+ +   +S  DC   K  K   +     SS+ LE I
Subjt:  LCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEII

Query:  HTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTM
        ++D+  S  + S    +Y++ F++ F+RY ++Y L +KSQ  D   +F + VE +    +  L SD GGE+    D           +L  HG     + 
Subjt:  HTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTM

Query:  PGTPQQNGVAE
        P TP+ NG++E
Subjt:  PGTPQQNGVAE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCGTGGACATTTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGA
CAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATAAAAAAAATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTA
GTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACGATTCATGTTTCCAATACGATGCATGGATTCCTTACGACC
CGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTTCAGTTGAAGCTGTGAGAACCTATTGTTTAACTTTAGATACTGGACATCAT
TTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGGATGT
TTTAGTTTATTCAAACAGAACATTTTTATTGGTTCCGGTATTCTTTGTGATGGCTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTG
CATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGATCACATATCCAAAGAAAGAATTAAAAGATTGATA
AAGAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTAGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGA
AGCTCACAACTCCTTGAAATTATACACACTGATATTTGTAGGTCTTTTGATGTTCGATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGAGGATTTCTCA
CGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATC
TTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGCTCCATTCGCTAAATTCCTAGAAAGCCATGGCAGATGTGCTCAA
TACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCATAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCGTGGACATTTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGA
CAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATAAAAAAAATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTA
GTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACGATTCATGTTTCCAATACGATGCATGGATTCCTTACGACC
CGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTTCAGTTGAAGCTGTGAGAACCTATTGTTTAACTTTAGATACTGGACATCAT
TTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGGATGT
TTTAGTTTATTCAAACAGAACATTTTTATTGGTTCCGGTATTCTTTGTGATGGCTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTG
CATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGATCACATATCCAAAGAAAGAATTAAAAGATTGATA
AAGAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTAGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGA
AGCTCACAACTCCTTGAAATTATACACACTGATATTTGTAGGTCTTTTGATGTTCGATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGAGGATTTCTCA
CGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATC
TTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGCTCCATTCGCTAAATTCCTAGAAAGCCATGGCAGATGTGCTCAA
TACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAATGTGA
Protein sequenceShow/hide protein sequence
MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTT
RTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDNVFAESLLTL
HHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFS
RYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAEM