; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039219 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039219
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr2:39206200..39214518
RNA-Seq ExpressionLag0039219
SyntenyLag0039219
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-9866.89Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT  ASQ V+D YDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANVVHSKR+F   SS         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKK--RKRSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRIDRLVKNGLLSK
        K K P+ A + KGKAK +A K K FHCNVD H K NC KYL +KK   + SSF+QL + EMTL+VGTG+VISA+AVG AKLG+IN+DRI RLVKNGLL+K
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKK--RKRSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRIDRLVKNGLLSK

Query:  LEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLR
        L+DDSLPPCESCLEGKMTKRPFT KGYRAKEPLELIHSDL G MNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+LR
Subjt:  LEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLR

Query:  SDRGG
        SDRGG
Subjt:  SDRGG

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

TYK04171.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

TYK16041.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-10166.03Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGL++KLEDDSLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+KAEVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRG
        NLL K IK+LRSDRG
Subjt:  NLLGKSIKVLRSDRG

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein1.2e-9866.89Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT  ASQ V+D YDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANVVHSKR+F   SS         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKK--RKRSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRIDRLVKNGLLSK
        K K P+ A + KGKAK +A K K FHCNVD H K NC KYL +KK   + SSF+QL + EMTL+VGTG+VISA+AVG AKLG+IN+DRI RLVKNGLL+K
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKK--RKRSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRIDRLVKNGLLSK

Query:  LEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLR
        L+DDSLPPCESCLEGKMTKRPFT KGYRAKEPLELIHSDL G MNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVENLL K IK+LR
Subjt:  LEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKSIKVLR

Query:  SDRGG
        SDRGG
Subjt:  SDRGG

A0A5A7UYE8 Gag/pol protein2.0e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

A0A5D3BUN8 Gag/pol protein2.0e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

A0A5D3BWT8 Gag/pol protein2.0e-10165.82Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGLL+KL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+K EVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRGG
        NLL K IK+LRSDRGG
Subjt:  NLLGKSIKVLRSDRGG

A0A5D3CVV9 Gag/pol protein6.9e-10266.03Show/hide
Query:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN
        PT +ASQ V+DAYDRWTKAND  R++ILAS+S+ L K+HE M                    I+ EANV HSKR+F    S         S+K QK+K  
Subjt:  PTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGM--------------------IEGEANVVHSKRKFEKGSSSGTKSIATSSKKTQKKKGN

Query:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI
        KGK P+ A + KGKAK +A K KCFHCNVD H K NC KYL +KK K              SSF+QL D EMTL+VGTG+VISA+AVG AKLG+IN+DRI
Subjt:  KGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKR-------------SSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYINIDRI

Query:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE
         RLVKNGL++KLEDDSLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSEALEKFKE+KAEVE
Subjt:  DRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVE

Query:  NLLGKSIKVLRSDRG
        NLL K IK+LRSDRG
Subjt:  NLLGKSIKVLRSDRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-0931.71Show/hide
Query:  INIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKF
        + I R +      LL+ LE  S   CE CL GK  + PF     +   K PL ++HSD+ GP+         YF+ F+D ++ Y   YL+ +KS+    F
Subjt:  INIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKF

Query:  KEFKAEVENLLGKSIKVLRSDRG
        ++F A+ E      +  L  D G
Subjt:  KEFKAEVENLLGKSIKVLRSDRG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-1835.71Show/hide
Query:  KLGYINIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALE
        ++G+++   +  L K  L+S  +  ++ PC+ CL GK  +  F     R    L+L++SD+ GPM +++ GG +YF++FIDD SR  ++Y++  K +  +
Subjt:  KLGYINIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALE

Query:  KFKEFKAEVENLLGKSIKVLRSDRGG
         F++F A VE   G+ +K LRSD GG
Subjt:  KFKEFKAEVENLLGKSIKVLRSDRGG

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.9e-0928.99Show/hide
Query:  LGYINIDRIDRLVKNGLLSKLEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYL
        LG+ N   I + +K   ++ L++  +         C  CL GK TK     KG R K     EP + +H+D++GP++   +    YFISF D+ +R+ ++
Subjt:  LGYINIDRIDRLVKNGLLSKLEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYL

Query:  YLMGHKSE--ALEKFKEFKAEVENLLGKSIKVLRSDRG
        Y +  + E   L  F    A ++N     + V++ DRG
Subjt:  YLMGHKSE--ALEKFKEFKAEVENLLGKSIKVLRSDRG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-1032.03Show/hide
Query:  AKLGYINIDRIDRLVKNGLLSKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEA
        A+LG+     ++ ++ N  LS L        C  CL  K  K PF+     +  PLE I+SD++    + +   Y Y++ F+D ++RY +LY +  KS+ 
Subjt:  AKLGYINIDRIDRLVKNGLLSKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEA

Query:  LEKFKEFKAEVENLLGKSIKVLRSDRGG
         E F  FK  +EN     I    SD GG
Subjt:  LEKFKEFKAEVENLLGKSIKVLRSDRGG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.1e-1131.25Show/hide
Query:  AKLGYINIDRIDRLVKNGLLSKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEA
        ++LG+ ++  ++ ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD++    + +   Y Y++ F+D ++RY +LY +  KS+ 
Subjt:  AKLGYINIDRIDRLVKNGLLSKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEA

Query:  LEKFKEFKAEVENLLGKSIKVLRSDRGG
         + F  FK+ VEN     I  L SD GG
Subjt:  LEKFKEFKAEVENLLGKSIKVLRSDRGG

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.6e-0533.82Show/hide
Query:  AKLGYINIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNV
        ++L +++   ++ LVK G L   +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL+G  +V
Subjt:  AKLGYINIDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCTATATTCACAACGTGAGATTCATGCTCGGCTTCGAGTCGCCTTAGGATCGCCTCCCTTCGAAAGTGTTTGCATGAATCAATATCAAGGTGAATGGGAAAAATG
TCTAGAAAGACAAAGCAATCGAGATTATTTCTTGCCTGAATTTCCATTAAACTATTCTGGAATAGCTTATGAGCCAGTCACTGTCGATAATTTTGAGCTGAAAGCTAGTC
TCATTCAAATGGTAAGAGAGAATGCTTTCAAAGGCCATCCATCAGAAGATCCACACCATCACCTAAGATCATTTCTGAATATCTGTGGAACTGTAAAAATGGGAGGAGTT
AGCCCTGACTCAATTCGGTTCAGTTATTTCCATTTTCTTTACAAGACAAGGCTAGAGATTGGAAATCCAGCAAGAAATTTATCTAATGTCATCGCCATGGACACACAAGA
ACAATGCCTGGCCATAACTCGGAGAAGTGGAAAGCAGGAAGCAGAGAGAAAAGAAGTCAAACATACCAACACTAGGAGGATTCTGGATGAAGATGATGATCAAAACACAG
AAGAGGAGGAGAAAGCGCCGCCATCTCTACCTGAAAAGACGTCCCATGAGTTCGTGGACTGCTCCAGTGGATATTGCATGGTTAAAGATGCTAGTTCAGTTGTTGATGGT
GATGTGCTTGTTACGACACCCAATGAGGCATGGGAGATCAATGAAAATCTGACAGAAACTACTCAATTCCATGCCACTCCTCATGATGAGCCAGAATGCCAAGTTTCAGA
GCAGCAGCAGGAAGGCTTAGTGATTGCAGTACCTGATCCCCTTGCTATTAGGCTTAGATGCCCTACTAGTAATGCATCCCAAGAAGTTAAGGATGCTTATGACCGCTGGA
CAAAGGCCAATGATATGACTCGTGTCTATATCTTAGCCAGCTTATCTGAAGATTTGCCTAAAAGGCATGAGGGCATGATAGAAGGAGAGGCAAACGTTGTTCACTCTAAA
AGAAAGTTCGAGAAGGGTTCATCCTCTGGAACTAAATCTATAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCAGTACTGCTGCTAA
AAGCAAGGGAAAAGCCAAAGCTATGGCAGATAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATTTGAAGAGAAACTGCCGAAAGTACCTTGCTGAGAAAAAAAGGA
AAAGAAGTTCATTTCGACAGTTGAACGATGGGGAAATGACACTCAGGGTTGGAACTGGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAACTTGGTTATATAAAC
ATCGATAGGATCGATCGTTTGGTAAAGAATGGACTTCTAAGCAAGTTAGAAGATGATTCACTACCACCTTGTGAATCTTGTCTCGAGGGAAAAATGACCAAGAGACCCTT
TACTGGAAAAGGTTACAGAGCCAAAGAACCCTTAGAGTTAATACATTCGGATCTATATGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTA
TAGATGATTATTCTAGATATGGTTACTTGTACCTAATGGGACATAAGTCTGAAGCCCTTGAAAAGTTTAAGGAGTTTAAGGCTGAAGTTGAAAACCTATTAGGAAAATCA
ATTAAAGTACTTCGATCTGATCGAGGAGGGAGTATATGGATCAAAGATTCCAGGATTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCTATATTCACAACGTGAGATTCATGCTCGGCTTCGAGTCGCCTTAGGATCGCCTCCCTTCGAAAGTGTTTGCATGAATCAATATCAAGGTGAATGGGAAAAATG
TCTAGAAAGACAAAGCAATCGAGATTATTTCTTGCCTGAATTTCCATTAAACTATTCTGGAATAGCTTATGAGCCAGTCACTGTCGATAATTTTGAGCTGAAAGCTAGTC
TCATTCAAATGGTAAGAGAGAATGCTTTCAAAGGCCATCCATCAGAAGATCCACACCATCACCTAAGATCATTTCTGAATATCTGTGGAACTGTAAAAATGGGAGGAGTT
AGCCCTGACTCAATTCGGTTCAGTTATTTCCATTTTCTTTACAAGACAAGGCTAGAGATTGGAAATCCAGCAAGAAATTTATCTAATGTCATCGCCATGGACACACAAGA
ACAATGCCTGGCCATAACTCGGAGAAGTGGAAAGCAGGAAGCAGAGAGAAAAGAAGTCAAACATACCAACACTAGGAGGATTCTGGATGAAGATGATGATCAAAACACAG
AAGAGGAGGAGAAAGCGCCGCCATCTCTACCTGAAAAGACGTCCCATGAGTTCGTGGACTGCTCCAGTGGATATTGCATGGTTAAAGATGCTAGTTCAGTTGTTGATGGT
GATGTGCTTGTTACGACACCCAATGAGGCATGGGAGATCAATGAAAATCTGACAGAAACTACTCAATTCCATGCCACTCCTCATGATGAGCCAGAATGCCAAGTTTCAGA
GCAGCAGCAGGAAGGCTTAGTGATTGCAGTACCTGATCCCCTTGCTATTAGGCTTAGATGCCCTACTAGTAATGCATCCCAAGAAGTTAAGGATGCTTATGACCGCTGGA
CAAAGGCCAATGATATGACTCGTGTCTATATCTTAGCCAGCTTATCTGAAGATTTGCCTAAAAGGCATGAGGGCATGATAGAAGGAGAGGCAAACGTTGTTCACTCTAAA
AGAAAGTTCGAGAAGGGTTCATCCTCTGGAACTAAATCTATAGCCACTTCTTCAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCAGTACTGCTGCTAA
AAGCAAGGGAAAAGCCAAAGCTATGGCAGATAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATTTGAAGAGAAACTGCCGAAAGTACCTTGCTGAGAAAAAAAGGA
AAAGAAGTTCATTTCGACAGTTGAACGATGGGGAAATGACACTCAGGGTTGGAACTGGAGAAGTCATTTCAGCTAAAGCAGTGGGAGCTGCGAAACTTGGTTATATAAAC
ATCGATAGGATCGATCGTTTGGTAAAGAATGGACTTCTAAGCAAGTTAGAAGATGATTCACTACCACCTTGTGAATCTTGTCTCGAGGGAAAAATGACCAAGAGACCCTT
TACTGGAAAAGGTTACAGAGCCAAAGAACCCTTAGAGTTAATACATTCGGATCTATATGGTCCAATGAATGTAAAAGCTCGAGGAGGGTACGAATATTTCATCTCATTTA
TAGATGATTATTCTAGATATGGTTACTTGTACCTAATGGGACATAAGTCTGAAGCCCTTGAAAAGTTTAAGGAGTTTAAGGCTGAAGTTGAAAACCTATTAGGAAAATCA
ATTAAAGTACTTCGATCTGATCGAGGAGGGAGTATATGGATCAAAGATTCCAGGATTATATGA
Protein sequenceShow/hide protein sequence
MSLYSQREIHARLRVALGSPPFESVCMNQYQGEWEKCLERQSNRDYFLPEFPLNYSGIAYEPVTVDNFELKASLIQMVRENAFKGHPSEDPHHHLRSFLNICGTVKMGGV
SPDSIRFSYFHFLYKTRLEIGNPARNLSNVIAMDTQEQCLAITRRSGKQEAERKEVKHTNTRRILDEDDDQNTEEEEKAPPSLPEKTSHEFVDCSSGYCMVKDASSVVDG
DVLVTTPNEAWEINENLTETTQFHATPHDEPECQVSEQQQEGLVIAVPDPLAIRLRCPTSNASQEVKDAYDRWTKANDMTRVYILASLSEDLPKRHEGMIEGEANVVHSK
RKFEKGSSSGTKSIATSSKKTQKKKGNKGKAPSTAAKSKGKAKAMADKGKCFHCNVDGHLKRNCRKYLAEKKRKRSSFRQLNDGEMTLRVGTGEVISAKAVGAAKLGYIN
IDRIDRLVKNGLLSKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLYGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALEKFKEFKAEVENLLGKS
IKVLRSDRGGSIWIKDSRII