; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001611 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001611
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr05:14644948..14647605
RNA-Seq ExpressionPay0001611
SyntenyPay0001611
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]7.5e-14550.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        +I VNGV VP  EVD  +AEEQ+SVGN RALN IFNGVD+NVFKLINSCS AKEAWKTLEVAYEGTSKVKISRLQLITSKF ALRMTEDESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  E---------------------------------------------------------------------------------------KSQMN-------
        +                                                                                       K+ M+       
Subjt:  E---------------------------------------------------------------------------------------KSQMN-------

Query:  ---------------------LYCSKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQY
                             L   KKNFRVTLSDEESVDSRDDDGN             D+++ECSVESKNDELSIEKL+TLWKEDCEAR IQKE IQ 
Subjt:  ---------------------LYCSKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQY

Query:  LLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKS
        LLEENEWLMSVISSLKLKLREV NENDQILKSVKMLNS   NLDSILK  HNGS RYGLGFV+SASSSKATSEIKFVPA MRV+YDTIHLETGIR  +KS
Subjt:  LLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKS

Query:  LGRTCYYCGRKGHI---------------------------------------------------------------------------------SDGAK
        LGRT YYCG+KGHI                                                                                  DGAK
Subjt:  LGRTCYYCGRKGHI---------------------------------------------------------------------------------SDGAK

Query:  GKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDV--------------------------------VDKIRS---WPWHRKLGHVS
        GKIIAKGNI+KDDLPRLNDVRYVDGL ANLISI+QLCD GYKVSFDD+                                 + IRS   W WHRKL H S
Subjt:  GKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDV--------------------------------VDKIRS---WPWHRKLGHVS

Query:  MRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQ---------TRSTHKTSVKG--------------------KKITRIRSDH
        MR LEK+IKN+ +VGIP+LDVNGNFFCGD QIGK+         TR T    +KG                    KKITRIRSDH
Subjt:  MRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQ---------TRSTHKTSVKG--------------------KKITRIRSDH

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-19953.68Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MI VNGV +P  EVD  D EEQ+SVGNARALN IFNGVD+NVFKLIN CS AKEAWKTLEVAYEGTSKVKISRLQL TSKF ALRMTEDESVSDYNK VL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC---------------------------------------------------------------------------------SKKNFRVTLS
        E +  +L                                                                                    KKNFRVTLS
Subjt:  EKSQMNLYC---------------------------------------------------------------------------------SKKNFRVTLS

Query:  DEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVK
        D+E VDSRDDDGN             D+D+ECS+ESKNDELSIEKLETLWKEDCEARAIQKERIQ LLEENE LMS                        
Subjt:  DEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVK

Query:  MLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHI--------------------
               NLDSILKA HNGSHRYGLGFVASASSSKATSEIKFVPA MRV+YDTIH +TGIRTP+KSLGRTCYYCGRKGHI                    
Subjt:  MLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHI--------------------

Query:  -------------------------------------------------------------SDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISIS
                                                                      DGAKGKIIAKGNIDKDDL RLNDVRYVDGL ANLI+IS
Subjt:  -------------------------------------------------------------SDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISIS

Query:  QLCDHGYKVSFDDV-----------------------------------VDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGK
        QLCD GYKVSFDD+                                   +   ++W WHRKLGHVSMR LEK+IKN+ +VGIPNLDVNGNFFC D QIGK
Subjt:  QLCDHGYKVSFDDV-----------------------------------VDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGK

Query:  QTRSTHKT------------------------SVKG--------------------KKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVV
        QTRSTHK+                        S+ G                    KKITRIRSDHG E D E FNSFCL EG HHEF APITPQQNGVV
Subjt:  QTRSTHKT------------------------SVKG--------------------KKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVV

Query:  ERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
        ERKN+TL+EM RVMIHAKNLPLCF+AEAVN ACHIH RVTIRTGTT+TLYE W+E K NVKYFHVFGSTCYILADREYH+KWDARS+Q
Subjt:  ERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ

KAA0053200.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-14959.26Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNG+ VP  EVD IDAEEQ+SVGNAR LN IFNGVD+NVFKLINSCS AKEAWKTL+V YEGTSKVKI+RLQLIT KF ALRM E+ESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC-------------------------------------------------------------------------------------------
        E +  +L                                                                                             
Subjt:  EKSQMNLYC-------------------------------------------------------------------------------------------

Query:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK
                                                KKNF VTLSDEE VDSRDD GN             D+D+ECSVESKNDEL IEKLETLWK
Subjt:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY
        EDCEAR IQKERIQ LLEENE LMSVISSLKLKLREV NENDQILKS KMLNS  +NLDSILKA HNGSHR+GLGFVASASSSKATSEIKFVPA MRV+Y
Subjt:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY

Query:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH
        DTIH+ETGIRTPIKS GRTCYYCGRKGHI      KIIAK NID DDLPRLNDVRYVDGL ANLIS+SQLCD GYKVSFDD+   IRS   W WHRKLGH
Subjt:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH

Query:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS
        VS+R L K+IKN+ +VGIP+LDVNGNFF GD QIGK+  S
Subjt:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS

TYK00141.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.5e-14959.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNG+ VP  EVD IDAEEQ+SVGNAR LN IFNGVD+NVFKLINSCS AKEAWKTL+V YEGTSKVKI+RLQLIT KF ALRM E+ESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC-------------------------------------------------------------------------------------------
        E +  +L                                                                                             
Subjt:  EKSQMNLYC-------------------------------------------------------------------------------------------

Query:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK
                                                KKNF VTLSDEE VDSRDD GN             D+D+ECSVESKNDEL IEKLETLWK
Subjt:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY
        EDCEAR IQKERIQ LLEENE LMSVISSLKLKLREV NENDQILKS KMLNS  +NLDSILKA HNGS+R+GLGFVASASSSKATSEIKFVPA MRV+Y
Subjt:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY

Query:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH
        DTIH+ETGIRTPIKS GRTCYYCGRKGHI      KIIAK NID DDLPRLNDVRYVDGL ANLIS+SQLCD GYKVSFDD+   IRS   W WHRKLGH
Subjt:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH

Query:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS
        VS+R L K+IKN+ +VGIP+LDVNGNFF GD QIGK+  S
Subjt:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS

XP_016902072.1 PREDICTED: uncharacterized protein LOC107991529 [Cucumis melo]2.3e-29596.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ
        EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ
Subjt:  EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ

Query:  ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI
        ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI
Subjt:  ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI

Query:  DKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRSTHKTSV
        DKDDLPRLNDV+YVDGLNANLISISQLCDHGYKV+FDDVVDKIRSWPWHRKLGHVSMR  EK+IKN+ +VGIP+L+VNGNFF GDYQI          SV
Subjt:  DKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRSTHKTSV

Query:  KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE
        KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE
Subjt:  KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE

Query:  ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
        ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
Subjt:  ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ

TrEMBL top hitse value%identityAlignment
A0A1S4E1H2 uncharacterized protein LOC1079915291.1e-29596.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ
        EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ
Subjt:  EKSQMNLYCSKKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQ

Query:  ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI
        ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI
Subjt:  ILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNI

Query:  DKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRSTHKTSV
        DKDDLPRLNDV+YVDGLNANLISISQLCDHGYKV+FDDVVDKIRSWPWHRKLGHVSMR  EK+IKN+ +VGIP+L+VNGNFF GDYQI          SV
Subjt:  DKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRSTHKTSV

Query:  KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE
        KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE
Subjt:  KGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWE

Query:  ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
        ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
Subjt:  ETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ

A0A5A7TNK7 Gag-pol polyprotein6.7e-20053.68Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MI VNGV +P  EVD  D EEQ+SVGNARALN IFNGVD+NVFKLIN CS AKEAWKTLEVAYEGTSKVKISRLQL TSKF ALRMTEDESVSDYNK VL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC---------------------------------------------------------------------------------SKKNFRVTLS
        E +  +L                                                                                    KKNFRVTLS
Subjt:  EKSQMNLYC---------------------------------------------------------------------------------SKKNFRVTLS

Query:  DEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVK
        D+E VDSRDDDGN             D+D+ECS+ESKNDELSIEKLETLWKEDCEARAIQKERIQ LLEENE LMS                        
Subjt:  DEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVK

Query:  MLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHI--------------------
               NLDSILKA HNGSHRYGLGFVASASSSKATSEIKFVPA MRV+YDTIH +TGIRTP+KSLGRTCYYCGRKGHI                    
Subjt:  MLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHI--------------------

Query:  -------------------------------------------------------------SDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISIS
                                                                      DGAKGKIIAKGNIDKDDL RLNDVRYVDGL ANLI+IS
Subjt:  -------------------------------------------------------------SDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISIS

Query:  QLCDHGYKVSFDDV-----------------------------------VDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGK
        QLCD GYKVSFDD+                                   +   ++W WHRKLGHVSMR LEK+IKN+ +VGIPNLDVNGNFFC D QIGK
Subjt:  QLCDHGYKVSFDDV-----------------------------------VDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGK

Query:  QTRSTHKT------------------------SVKG--------------------KKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVV
        QTRSTHK+                        S+ G                    KKITRIRSDHG E D E FNSFCL EG HHEF APITPQQNGVV
Subjt:  QTRSTHKT------------------------SVKG--------------------KKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVV

Query:  ERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
        ERKN+TL+EM RVMIHAKNLPLCF+AEAVN ACHIH RVTIRTGTT+TLYE W+E K NVKYFHVFGSTCYILADREYH+KWDARS+Q
Subjt:  ERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ

A0A5A7UDB5 Gag-pol polyprotein6.4e-15059.26Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNG+ VP  EVD IDAEEQ+SVGNAR LN IFNGVD+NVFKLINSCS AKEAWKTL+V YEGTSKVKI+RLQLIT KF ALRM E+ESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC-------------------------------------------------------------------------------------------
        E +  +L                                                                                             
Subjt:  EKSQMNLYC-------------------------------------------------------------------------------------------

Query:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK
                                                KKNF VTLSDEE VDSRDD GN             D+D+ECSVESKNDEL IEKLETLWK
Subjt:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY
        EDCEAR IQKERIQ LLEENE LMSVISSLKLKLREV NENDQILKS KMLNS  +NLDSILKA HNGSHR+GLGFVASASSSKATSEIKFVPA MRV+Y
Subjt:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY

Query:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH
        DTIH+ETGIRTPIKS GRTCYYCGRKGHI      KIIAK NID DDLPRLNDVRYVDGL ANLIS+SQLCD GYKVSFDD+   IRS   W WHRKLGH
Subjt:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH

Query:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS
        VS+R L K+IKN+ +VGIP+LDVNGNFF GD QIGK+  S
Subjt:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS

A0A5D3BJZ7 Gag-pol polyprotein3.2e-14959.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        MITVNG+ VP  EVD IDAEEQ+SVGNAR LN IFNGVD+NVFKLINSCS AKEAWKTL+V YEGTSKVKI+RLQLIT KF ALRM E+ESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  EKSQMNLYC-------------------------------------------------------------------------------------------
        E +  +L                                                                                             
Subjt:  EKSQMNLYC-------------------------------------------------------------------------------------------

Query:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK
                                                KKNF VTLSDEE VDSRDD GN             D+D+ECSVESKNDEL IEKLETLWK
Subjt:  ---------------------------------------SKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWK

Query:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY
        EDCEAR IQKERIQ LLEENE LMSVISSLKLKLREV NENDQILKS KMLNS  +NLDSILKA HNGS+R+GLGFVASASSSKATSEIKFVPA MRV+Y
Subjt:  EDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKY

Query:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH
        DTIH+ETGIRTPIKS GRTCYYCGRKGHI      KIIAK NID DDLPRLNDVRYVDGL ANLIS+SQLCD GYKVSFDD+   IRS   W WHRKLGH
Subjt:  DTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDVVDKIRS---WPWHRKLGH

Query:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS
        VS+R L K+IKN+ +VGIP+LDVNGNFF GD QIGK+  S
Subjt:  VSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRS

A0A5D3E2Y4 F5J5.13.6e-14550.07Show/hide
Query:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL
        +I VNGV VP  EVD  +AEEQ+SVGN RALN IFNGVD+NVFKLINSCS AKEAWKTLEVAYEGTSKVKISRLQLITSKF ALRMTEDESVSDYNKRVL
Subjt:  MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVL

Query:  E---------------------------------------------------------------------------------------KSQMN-------
        +                                                                                       K+ M+       
Subjt:  E---------------------------------------------------------------------------------------KSQMN-------

Query:  ---------------------LYCSKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQY
                             L   KKNFRVTLSDEESVDSRDDDGN             D+++ECSVESKNDELSIEKL+TLWKEDCEAR IQKE IQ 
Subjt:  ---------------------LYCSKKNFRVTLSDEESVDSRDDDGN-------------DNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQY

Query:  LLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKS
        LLEENEWLMSVISSLKLKLREV NENDQILKSVKMLNS   NLDSILK  HNGS RYGLGFV+SASSSKATSEIKFVPA MRV+YDTIHLETGIR  +KS
Subjt:  LLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILKARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKS

Query:  LGRTCYYCGRKGHI---------------------------------------------------------------------------------SDGAK
        LGRT YYCG+KGHI                                                                                  DGAK
Subjt:  LGRTCYYCGRKGHI---------------------------------------------------------------------------------SDGAK

Query:  GKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDV--------------------------------VDKIRS---WPWHRKLGHVS
        GKIIAKGNI+KDDLPRLNDVRYVDGL ANLISI+QLCD GYKVSFDD+                                 + IRS   W WHRKL H S
Subjt:  GKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDHGYKVSFDDV--------------------------------VDKIRS---WPWHRKLGHVS

Query:  MRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQ---------TRSTHKTSVKG--------------------KKITRIRSDH
        MR LEK+IKN+ +VGIP+LDVNGNFFCGD QIGK+         TR T    +KG                    KKITRIRSDH
Subjt:  MRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQ---------TRSTHKTSVKG--------------------KKITRIRSDH

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-1536.75Show/hide
Query:  KITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIR--TGTTVTLYELWEE
        K+  +  D+G E  + +   FC+ +GI +    P TPQ NGV ER  RT+ E  R M+    L   FW EAV  A ++  R+  R    ++ T YE+W  
Subjt:  KITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIR--TGTTVTLYELWEE

Query:  TKPNVKYFHVFGSTCYI
         KP +K+  VFG+T Y+
Subjt:  TKPNVKYFHVFGSTCYI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1736.64Show/hide
Query:  GKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEE
        G+K+ R+RSD+G E  + +F  +C S GI HE   P TPQ NGV ER NRT+ E  R M+    LP  FW EAV  AC++  R             +W  
Subjt:  GKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEE

Query:  TKPNVKYFHVFGSTCYILADREYHQKWDARS
         + +  +  VFG   +    +E   K D +S
Subjt:  TKPNVKYFHVFGSTCYILADREYHQKWDARS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-0629.53Show/hide
Query:  KQTRSTHKTSVKGKKITRI---RSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVT
        K+T  T K  ++ +  TRI    SD+G E        +    GI H    P TP+ NG+ ERK+R + E    ++   ++P  +W  A  +A ++  R+ 
Subjt:  KQTRSTHKTSVKGKKITRI---RSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVT

Query:  IRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQ-KWDARSKQ
               + ++    T PN     VFG  CY    R Y+Q K D +S+Q
Subjt:  IRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQ-KWDARSKQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-0627.7Show/hide
Query:  KQTRSTHKTSVKGKKITRI---RSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVT
        K T    K+ V+ +  TRI    SD+G E        +    GI H    P TP+ NG+ ERK+R + EM   ++   ++P  +W  A ++A ++  R+ 
Subjt:  KQTRSTHKTSVKGKKITRI---RSDHGNESDTEDFNSFCLSEGIHHEFFAPITPQQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVT

Query:  IRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ
               + ++      PN +   VFG  CY         K + +SKQ
Subjt:  IRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACTGTGAATGGTGTCTTAGTTCCAAATCTTGAAGTTGATTGTATCGATGCTGAAGAGCAATCTTCTGTTGGGAATGCCAGAGCTCTTAATGTGATATTTAATGG
TGTTGACGTGAACGTTTTCAAGTTAATAAATTCCTGCAGTATAGCCAAAGAAGCTTGGAAAACCTTGGAGGTAGCATATGAAGGTACCTCCAAAGTAAAGATCTCAAGAC
TACAGCTGATAACATCTAAGTTTGGGGCATTGAGAATGACCGAGGATGAATCAGTGTCTGATTACAATAAGAGAGTTCTTGAAAAATCGCAAATGAATCTCTACTGTTCG
AAGAAAAACTTTCGTGTCACACTGTCAGATGAAGAATCTGTTGATAGTAGAGATGATGATGGCAACGATAATGATAATGAATGTTCGGTAGAAAGTAAAAATGATGAATT
GTCAATTGAGAAGCTCGAAACTCTATGGAAAGAAGATTGTGAAGCAAGGGCAATACAAAAGGAAAGGATCCAATATCTTTTAGAAGAAAATGAATGGTTGATGTCTGTAA
TATCTTCTCTAAAGTTAAAATTGAGAGAGGTTCATAATGAGAATGATCAAATTTTAAAATCCGTTAAAATGCTAAATTCAAGAATGGATAATCTAGATTCAATACTAAAA
GCTAGACATAATGGCTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGTTCTAAAGCTACATCAGAAATCAAATTTGTCCCTGCCTTAATGAGAGTTAAATA
TGACACGATTCATTTAGAGACTGGCATCAGGACTCCAATTAAATCTCTTGGAAGAACTTGTTACTATTGTGGTCGAAAAGGTCATATCAGTGATGGTGCAAAAGGAAAAA
TTATAGCTAAAGGTAACATAGACAAAGATGATCTACCACGACTGAATGATGTTAGGTATGTGGATGGACTAAATGCAAACTTGATCAGTATAAGTCAACTGTGTGATCAT
GGTTACAAAGTTAGTTTTGATGATGTTGTTGATAAGATCCGATCATGGCCATGGCACAGAAAGCTGGGGCATGTCAGCATGAGAGTCTTGGAAAAGATTATAAAAAATGA
AGTTATTGTAGGAATTCCTAATCTAGACGTAAATGGTAATTTCTTCTGTGGAGACTATCAAATTGGCAAGCAGACAAGGTCTACTCATAAAACTAGTGTGAAAGGGAAGA
AGATAACAAGAATCCGAAGTGATCATGGTAATGAATCCGATACTGAAGACTTTAACAGTTTTTGTCTGTCAGAAGGAATACACCATGAGTTTTTTGCACCAATAACTCCT
CAACAAAATGGTGTAGTAGAAAGAAAGAACAGGACATTACGAGAAATGGATCGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATAT
TGCCTGTCACATTCATAAAAGAGTAACTATTAGAACTGGAACAACTGTTACTCTTTATGAACTTTGGGAAGAGACAAAACCAAATGTCAAATACTTCCATGTGTTTGGAA
GTACATGTTATATCTTAGCTGACCGAGAATACCATCAGAAATGGGATGCAAGGTCGAAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTACTGTGAATGGTGTCTTAGTTCCAAATCTTGAAGTTGATTGTATCGATGCTGAAGAGCAATCTTCTGTTGGGAATGCCAGAGCTCTTAATGTGATATTTAATGG
TGTTGACGTGAACGTTTTCAAGTTAATAAATTCCTGCAGTATAGCCAAAGAAGCTTGGAAAACCTTGGAGGTAGCATATGAAGGTACCTCCAAAGTAAAGATCTCAAGAC
TACAGCTGATAACATCTAAGTTTGGGGCATTGAGAATGACCGAGGATGAATCAGTGTCTGATTACAATAAGAGAGTTCTTGAAAAATCGCAAATGAATCTCTACTGTTCG
AAGAAAAACTTTCGTGTCACACTGTCAGATGAAGAATCTGTTGATAGTAGAGATGATGATGGCAACGATAATGATAATGAATGTTCGGTAGAAAGTAAAAATGATGAATT
GTCAATTGAGAAGCTCGAAACTCTATGGAAAGAAGATTGTGAAGCAAGGGCAATACAAAAGGAAAGGATCCAATATCTTTTAGAAGAAAATGAATGGTTGATGTCTGTAA
TATCTTCTCTAAAGTTAAAATTGAGAGAGGTTCATAATGAGAATGATCAAATTTTAAAATCCGTTAAAATGCTAAATTCAAGAATGGATAATCTAGATTCAATACTAAAA
GCTAGACATAATGGCTCTCATAGATATGGGTTGGGATTTGTGGCCTCTGCAAGTAGTTCTAAAGCTACATCAGAAATCAAATTTGTCCCTGCCTTAATGAGAGTTAAATA
TGACACGATTCATTTAGAGACTGGCATCAGGACTCCAATTAAATCTCTTGGAAGAACTTGTTACTATTGTGGTCGAAAAGGTCATATCAGTGATGGTGCAAAAGGAAAAA
TTATAGCTAAAGGTAACATAGACAAAGATGATCTACCACGACTGAATGATGTTAGGTATGTGGATGGACTAAATGCAAACTTGATCAGTATAAGTCAACTGTGTGATCAT
GGTTACAAAGTTAGTTTTGATGATGTTGTTGATAAGATCCGATCATGGCCATGGCACAGAAAGCTGGGGCATGTCAGCATGAGAGTCTTGGAAAAGATTATAAAAAATGA
AGTTATTGTAGGAATTCCTAATCTAGACGTAAATGGTAATTTCTTCTGTGGAGACTATCAAATTGGCAAGCAGACAAGGTCTACTCATAAAACTAGTGTGAAAGGGAAGA
AGATAACAAGAATCCGAAGTGATCATGGTAATGAATCCGATACTGAAGACTTTAACAGTTTTTGTCTGTCAGAAGGAATACACCATGAGTTTTTTGCACCAATAACTCCT
CAACAAAATGGTGTAGTAGAAAGAAAGAACAGGACATTACGAGAAATGGATCGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATAT
TGCCTGTCACATTCATAAAAGAGTAACTATTAGAACTGGAACAACTGTTACTCTTTATGAACTTTGGGAAGAGACAAAACCAAATGTCAAATACTTCCATGTGTTTGGAA
GTACATGTTATATCTTAGCTGACCGAGAATACCATCAGAAATGGGATGCAAGGTCGAAACAATGA
Protein sequenceShow/hide protein sequence
MITVNGVLVPNLEVDCIDAEEQSSVGNARALNVIFNGVDVNVFKLINSCSIAKEAWKTLEVAYEGTSKVKISRLQLITSKFGALRMTEDESVSDYNKRVLEKSQMNLYCS
KKNFRVTLSDEESVDSRDDDGNDNDNECSVESKNDELSIEKLETLWKEDCEARAIQKERIQYLLEENEWLMSVISSLKLKLREVHNENDQILKSVKMLNSRMDNLDSILK
ARHNGSHRYGLGFVASASSSKATSEIKFVPALMRVKYDTIHLETGIRTPIKSLGRTCYYCGRKGHISDGAKGKIIAKGNIDKDDLPRLNDVRYVDGLNANLISISQLCDH
GYKVSFDDVVDKIRSWPWHRKLGHVSMRVLEKIIKNEVIVGIPNLDVNGNFFCGDYQIGKQTRSTHKTSVKGKKITRIRSDHGNESDTEDFNSFCLSEGIHHEFFAPITP
QQNGVVERKNRTLREMDRVMIHAKNLPLCFWAEAVNIACHIHKRVTIRTGTTVTLYELWEETKPNVKYFHVFGSTCYILADREYHQKWDARSKQ