; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0073151 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0073151
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr03:20561682..20562767
RNA-Seq ExpressionCmc03g0073151
SyntenyCmc03g0073151
Gene Ontology termsGO:0009231 - riboflavin biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008686 - 3,4-dihydroxy-2-butanone-4-phosphate synthase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP36562.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.5e-15774.52Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGF   +T +PNE+F+FMGNRVKVPVEAVGTYRL L+TGHHL+L +T YVPS SRNL+SLSKLD  GY F FGN CFSLFK+N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + ILCD LYKL LD ++ E+LLTLHHN+GTKRS  NE SA+LWH+RLGHIS+ER++RLIKNEIL +LDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA+D L++++NEVERQLD+KVK++RSDRGGEYYG+       PGPFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRNRTLM+MVRSML NS+LP+ LWMYAL+TA YLLNRVPSK+V KT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

KYP65984.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.3e-15376.2Show/hide
Query:  MQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLK
        MQGFL TRTT PNE+F+FMGNRVKVPVEAVGTYRL LDTGHHL+LF+T YVPS SRNL+SLSKLD +GY  KFGN CFSL+K    IGS ILCD LYKL 
Subjt:  MQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLK

Query:  LDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDV
        LDN+FAE+LLTLHHN+GTKR   NE  AYLWHKRLGH+SKER++RL+KNEIL DLDFTDL + VDCIKGKQTKH   K ATRS+ LLEIIHT ICGPFDV
Subjt:  LDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDV

Query:  PSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVA
         SF  E YFITFIDD+SRYGY+YLLH+KSQAI+ L+++I EVERQLD KVKI+RSDRGGEYYG+       PGPFAKFLE  GICAQYTMPGTPQQNGV+
Subjt:  PSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        ER NRTL++MVRSML NSSL +SLW YAL++A YLLNRVPSK+VPKT FELWT
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

RZC09906.1 B2 protein isoform D [Glycine soja]6.2e-15175.21Show/hide
Query:  GFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLKLD
        GFL  +T +PN++F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG+ +LCD LYKLKLD
Subjt:  GFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLKLD

Query:  NIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDVPS
         ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS+ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLEI+HT ICGPFDV S
Subjt:  NIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDVPS

Query:  FGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAER
        FG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDR GEYY      G+  GPFAK L+  GICAQYTMPGT QQNGV+ER
Subjt:  FGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAER

Query:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        RNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK++PKT FELWT
Subjt:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

RZC12927.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja]6.0e-16276.71Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGFL  +T +PNE+F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + +LCD LYKLKLD ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS+ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDRGGEYYG+       PGPFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLN VPSK+VPKT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]6.6e-16176.44Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGFL  +T +PNE+F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + +LCD LYKLKLD ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDRGGEYY      G+ P PFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRN+TLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK+VPKT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

TrEMBL top hitse value%identityAlignment
A0A151R237 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-15874.52Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGF   +T +PNE+F+FMGNRVKVPVEAVGTYRL L+TGHHL+L +T YVPS SRNL+SLSKLD  GY F FGN CFSLFK+N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + ILCD LYKL LD ++ E+LLTLHHN+GTKRS  NE SA+LWH+RLGHIS+ER++RLIKNEIL +LDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA+D L++++NEVERQLD+KVK++RSDRGGEYYG+       PGPFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRNRTLM+MVRSML NS+LP+ LWMYAL+TA YLLNRVPSK+V KT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

A0A151TG02 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-15476.2Show/hide
Query:  MQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLK
        MQGFL TRTT PNE+F+FMGNRVKVPVEAVGTYRL LDTGHHL+LF+T YVPS SRNL+SLSKLD +GY  KFGN CFSL+K    IGS ILCD LYKL 
Subjt:  MQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLK

Query:  LDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDV
        LDN+FAE+LLTLHHN+GTKR   NE  AYLWHKRLGH+SKER++RL+KNEIL DLDFTDL + VDCIKGKQTKH   K ATRS+ LLEIIHT ICGPFDV
Subjt:  LDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDV

Query:  PSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVA
         SF  E YFITFIDD+SRYGY+YLLH+KSQAI+ L+++I EVERQLD KVKI+RSDRGGEYYG+       PGPFAKFLE  GICAQYTMPGTPQQNGV+
Subjt:  PSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVA

Query:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        ER NRTL++MVRSML NSSL +SLW YAL++A YLLNRVPSK+VPKT FELWT
Subjt:  ERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

A0A445KGB1 B2 protein isoform D3.0e-15175.21Show/hide
Query:  GFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLKLD
        GFL  +T +PN++F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG+ +LCD LYKLKLD
Subjt:  GFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDLYKLKLD

Query:  NIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDVPS
         ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS+ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLEI+HT ICGPFDV S
Subjt:  NIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDVPS

Query:  FGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAER
        FG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDR GEYY      G+  GPFAK L+  GICAQYTMPGT QQNGV+ER
Subjt:  FGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAER

Query:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        RNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK++PKT FELWT
Subjt:  RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

A0A445KPR8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A2.9e-16276.71Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGFL  +T +PNE+F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + +LCD LYKLKLD ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS+ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDRGGEYYG+       PGPFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKC------PGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRNRTLM+MVRSMLINS+LPVSLWMYAL+TA YLLN VPSK+VPKT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-16176.44Show/hide
Query:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG
        +DSGCT HVSNTMQGFL  +T +PNE+F+FMGNRVK PVEAVGTYRL LDTGHHL+L +T YVPS SRNL+SLSKLD +GY F FGN CFSLFK N  IG
Subjt:  VDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIG

Query:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE
        + +LCD LYKLKLD ++ E++LTLHHNVGTKRS  NE SA+LWHKRLGHIS ERI+RLIKNEIL DLDFTDL I VDCIKGKQTKH   K ATRS+ LLE
Subjt:  SDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLE

Query:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQY
        I+HT ICGPFDV SFG E YFITFIDD+SRYGY+YLLHEKSQA++ L++++NEVERQLDRKVKI+RSDRGGEYY      G+ P PFAK L+  GICAQY
Subjt:  IIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYY------GKCPGPFAKFLESHGICAQY

Query:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT
        TMPGTPQQNGV+ERRN+TLM+MVRSMLINS+LPVSLWMYAL+TA YLLNRVPSK+VPKT FELWT
Subjt:  TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTHFELWT

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.6e-3227.64Show/hide
Query:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFI
        V+DSG + H+ N    +  +    P  +         +     G  RL  D  H + L D  +    + NL+S+ +L  +G   +F     ++ K  + +
Subjt:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFI

Query:  GSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQD---LDFTDLG--IYVDCIKGKQTKHIVN--KEA
          +     L  + + N  A S+   H N           +  LWH+R GHIS  ++  + +  +  D   L+  +L   I   C+ GKQ +      K+ 
Subjt:  GSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQD---LDFTDLG--IYVDCIKGKQTKHIVN--KEA

Query:  TRSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICA
        T     L ++H+ +CGP    +   + YF+ F+D F+ Y   YL+  KS    + + F+ + E   + KV  L  D G EY         +F    GI  
Subjt:  TRSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICA

Query:  QYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSV---PKTHFELW
          T+P TPQ NGV+ER  RT+    R+M+  + L  S W  A+ TA YL+NR+PS+++    KT +E+W
Subjt:  QYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSV---PKTHFELW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-5135.63Show/hide
Query:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFI
        VVD+  + H +      L  R    +   + MGN     +  +G   +  + G  L L D  +VP    NLIS   LD  GY   F N  + L K ++ I
Subjt:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFI

Query:  GSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLL
           +    LY+   +              G   +  +E S  LWHKR+GH+S++ ++ L K  ++     T +     C+ GKQ +      + R  ++L
Subjt:  GSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLL

Query:  EIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGT
        +++++ +CGP ++ S GG  YF+TFIDD SR  ++Y+L  K Q   V + F   VER+  RK+K LRSD GGEY  +    F ++  SHGI  + T+PGT
Subjt:  EIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGT

Query:  PQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS
        PQ NGVAER NRT++  VRSML  + LP S W  A++TA YL+NR PS
Subjt:  PQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein2.4e-2525.82Show/hide
Query:  MVVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIF
        +++DSG +  +  +   +L   T N +E  I    +  +P+ A+G        G   ++    + P+ + +L+SLS+L        F  +          
Subjt:  MVVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIF

Query:  IGSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEIL-----QDLDFTDLGIY--VDCIKGKQTKHIVNK-
        +   +   D Y L    +    +  L  N   K    N+    L H+ LGH +   I++ +K   +      D+++++   Y   DC+ GK TKH   K 
Subjt:  IGSDILCDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEIL-----QDLDFTDLGIY--VDCIKGKQTKHIVNK-

Query:  ---EATRSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQ--AIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFL
           +   S    + +HT I GP          YFI+F D+ +R+ ++Y LH++ +   ++V    +  ++ Q + +V +++ DRG EY  K      KF 
Subjt:  ---EATRSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQ--AIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFL

Query:  ESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKT
         + GI A YT     + +GVAER NRTL+N  R++L  S LP  LW  A+  +  + N + S    K+
Subjt:  ESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-2825.71Show/hide
Query:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKL-DTSGYYFKFGNDCFSLFKQNIF
        ++DSG T H+++      + +     +  + + +   +P+   G+  L+  +   LNL +  YVP+  +NLIS+ +L + +G   +F    F +  +++ 
Subjt:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKL-DTSGYYFKFGNDCFSLFKQNIF

Query:  IGSDIL----CDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYV-DCIKGKQTKHIVNKEAT
         G  +L     D+LY+  + +    SL           S +++++   WH RLGH +   +  +I N  L  L+ +   +   DC+  K  K   ++   
Subjt:  IGSDIL----CDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYV-DCIKGKQTKHIVNKEAT

Query:  RSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQ
         S+  LE I++ +     + S     Y++ F+D F+RY ++Y L +KSQ  +    F N +E +   ++    SD GGE+         ++   HGI   
Subjt:  RSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQ

Query:  YTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS
         + P TP+ NG++ER++R ++    ++L ++S+P + W YA   A YL+NR+P+
Subjt:  YTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-2827.4Show/hide
Query:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKL-DTSGYYFKFGNDCFSLFKQNIF
        ++DSG T H+++        +     +  + + +   +P+   G+  L   +   L+L    YVP+  +NLIS+ +L +T+    +F    F +  +++ 
Subjt:  VVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKL-DTSGYYFKFGNDCFSLFKQNIF

Query:  IGSDIL----CDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFT-DLGIYVDCIKGKQTKHIVNKEAT
         G  +L     D+LY+  + +  A S+           S  ++++   WH RLGH S   +  +I N  L  L+ +  L    DC   K  K   +    
Subjt:  IGSDIL----CDDLYKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFT-DLGIYVDCIKGKQTKHIVNKEAT

Query:  RSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQ
         SS  LE I++ +     + S     Y++ F+D F+RY ++Y L +KSQ  D   +F + VE +   ++  L SD GGE+          +L  HGI   
Subjt:  RSSHLLEIIHTHICGPFDVPSFGGEMYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQ

Query:  YTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS
         + P TP+ NG++ER++R ++ M  ++L ++S+P + W YA   A YL+NR+P+
Subjt:  YTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTATGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGT
CAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTGGACATCATTTAAACCTTTTTGATACCTTTTATGTTCCTTCTTTTTCTCGTAATTTGATTT
CCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGATTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGATATTCTTTGTGATGACTTA
TATAAATTAAAGCTTGATAATATTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTAGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAA
ACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTAATAAAGAATGAAATTCTTCAAGATTTGGATTTTACTGACCTTGGAATTTATGTGGATTGTATTAAAGGAA
AACAAACAAAACACATAGTTAATAAAGAAGCCACAAGAAGCTCACATCTCCTTGAAATTATACACACTCATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAA
ATGTATTTTATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGTCTTAAAAGTATTTATAAATGAAGTTGA
AAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTG
CTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTG
TCCTTGTGGATGTATGCATTAAGAACAGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACATTTTGAACTGTGGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTATGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGT
CAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTGGACATCATTTAAACCTTTTTGATACCTTTTATGTTCCTTCTTTTTCTCGTAATTTGATTT
CCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGATTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGATATTCTTTGTGATGACTTA
TATAAATTAAAGCTTGATAATATTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTAGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAA
ACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTAATAAAGAATGAAATTCTTCAAGATTTGGATTTTACTGACCTTGGAATTTATGTGGATTGTATTAAAGGAA
AACAAACAAAACACATAGTTAATAAAGAAGCCACAAGAAGCTCACATCTCCTTGAAATTATACACACTCATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAA
ATGTATTTTATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGTCTTAAAAGTATTTATAAATGAAGTTGA
AAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTG
CTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTG
TCCTTGTGGATGTATGCATTAAGAACAGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACATTTTGAACTGTGGACATGA
Protein sequenceShow/hide protein sequence
MVVDSGCTIHVSNTMQGFLMTRTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTGHHLNLFDTFYVPSFSRNLISLSKLDTSGYYFKFGNDCFSLFKQNIFIGSDILCDDL
YKLKLDNIFAESLLTLHHNVGTKRSQTNESSAYLWHKRLGHISKERIKRLIKNEILQDLDFTDLGIYVDCIKGKQTKHIVNKEATRSSHLLEIIHTHICGPFDVPSFGGE
MYFITFIDDFSRYGYIYLLHEKSQAIDVLKVFINEVERQLDRKVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPV
SLWMYALRTAQYLLNRVPSKSVPKTHFELWT