; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014476 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014476
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr09:17746463..17748494
RNA-Seq ExpressionPay0014476
SyntenyPay0014476
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032462.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]8.0e-15776.79Show/hide
Query:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT
        MEIIREGPSAS  PVLDGKNYSYWKP MIFFIKTLDGKAWR LVGGYEPPMVTVN V VPKPE++WTDAEEQASVGNARAINAIF  VDLN FKLINSCT
Subjt:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT

Query:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE
        TAKEA KILEVAYEGTSKVKISRLQLITSKFEA KM EDESVS+YNERVLEIANDSLLLGEKI ESKIV KVLRSLPRK DMKV AIEEA+DITT+KLDE
Subjt:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE

Query:  LFGSLLTFEMASG-----------------------NETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKED
        LFGSLLTFEMA                         N+ NQDESIALLTKQFSKMARKFKSLNTAG+T+KTGRHDGENS RKVN+ SYRRN+DH KK ED
Subjt:  LFGSLLTFEMASG-----------------------NETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKED

Query:  VGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLM
        VGRSFRCKECEGFGHYQAE    +      + A ++           EINSEVDSECF+ DEDEELTLE+LKMLRKEDSEARAI+KERIQ+LM+ENERL+
Subjt:  VGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLM

Query:  RVLNS
         +++S
Subjt:  RVLNS

KAA0060126.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.2e-16474.83Show/hide
Query:  MVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVL
        M+TVN V VPK EI WTDA+EQA VGNARAINAIF  VDLNVFKLINSCTTAKEA KILEVAYEGTSK KISRLQLITSKFEALKMTEDE VSEYNERVL
Subjt:  MVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVL

Query:  EIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMASGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGR
        +IAN+SLLLGEKI E KIV KVLRSLPRK DMKV A EEAQ+ITTLKLDELFGSLLTFEM S   ++++   A LTKQFSKMAR FK LNT GKT+KT R
Subjt:  EIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMASGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGR

Query:  HDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDED
        HD ENS RKVN+ SYRRNSDHGKK EDVGRSFRC+EC+GFGHYQ +CPTYL+RQKKNYC+TLSDEDSDDDEDDH          +INSE DSECF+IDED
Subjt:  HDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDED

Query:  EELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS------PDGS-----------VIIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRS
        EELTLEEL +LRKE SEARAI+KERIQDLMDENERLM +++S      P+             ++  +E+CNVAFTTVQTHVDAWYFDS  SRHMT NRS
Subjt:  EELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS------PDGS-----------VIIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRS

Query:  FFTELEECASGHVTFGDGAKGKIIAKGNIDKSNLPCLNEVR
        FFTELEECASGHVTFGDGAKGKII K   DK+N   +++ R
Subjt:  FFTELEECASGHVTFGDGAKGKIIAKGNIDKSNLPCLNEVR

TYK22564.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.6e-17183.72Show/hide
Query:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT
        MEIIREGPS S PPVLDGKNYSYWKP MIFFIKTLDGKAWR LVGGYE  +VTVN V VPKPEI+WTDAEEQASV NARAIN IF GVDLNVFKLINSCT
Subjt:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT

Query:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE
        TAKEA KIL+VAYEGTSKVKISRLQLITSKFEALKMTEDE+VSEYNERVLEI NDSLL  EKISESKIV KVL SLPRK D KV AIEEAQDITTLKLD+
Subjt:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE

Query:  LFGSLLTFEMA-SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPT
        L G LLTFEMA S  ETNQDESIA+LTKQFSKMARKFKSLNTAG+++KTGRHDGENS RKVN+FSYRRNSDHGKKKEDVGRSFRC+ECEGFGHYQ ECPT
Subjt:  LFGSLLTFEMA-SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPT

Query:  YLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS
        YLRR+ KNYCATL DED DD+EDDH          +INSE DSECF+ +EDEELTLEELKMLRKEDSEARAI+KERIQDLMD+NE+LM V++S
Subjt:  YLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS

XP_008454684.1 PREDICTED: uncharacterized protein LOC103495039 [Cucumis melo]6.8e-15661.1Show/hide
Query:  EINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEK
        +I+WTDAEEQASVGNARAIN IF GVDLNVFKLI+SCTTAKEA KILEVAYEGTSKVKI RLQL+TSKFEALKM EDE+ SEYNERVLEIANDSLLLGEK
Subjt:  EINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEK

Query:  ISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMAS--------------------------GNETNQDESIALLTKQFSKMARKF
        I ESKIV KVLRSLPRK D+KV AIEEAQDITTLKLDELFGSLLTFEMA                           GNE NQDESIAL+ + FSKMARKF
Subjt:  ISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMAS--------------------------GNETNQDESIALLTKQFSKMARKF

Query:  KSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFN
        KSLNTAGKT                           +K EDV                AECPTYL+RQKKNYCATLSDEDSDDDEDDH    E DSEC +
Subjt:  KSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFN

Query:  IDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS-------------------PDGSV------------------------------
        IDEDEELT EELK+LRKEDSEARAI+KERIQDLMDENERLM +++S                    +GS                               
Subjt:  IDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS-------------------PDGSV------------------------------

Query:  -------------------------------IIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKSN
                                       +  +E+CN+AFTTVQTH DAWYFDS CSRHMT NR FFTELEEC+S HVTFGDGAKGKIIAKGNIDKSN
Subjt:  -------------------------------IIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKSN

Query:  LPCLNEVRYVNGLKANLISISQLCDQRYSVNFNNTGCVVTDKNNQ
        LPCLNEVRYV+GL ANL S+S+LCDQ Y+VNFNNT C+VTDKNNQ
Subjt:  LPCLNEVRYVNGLKANLISISQLCDQRYSVNFNNTGCVVTDKNNQ

XP_016903608.1 PREDICTED: uncharacterized protein LOC107992254 [Cucumis melo]3.7e-16261.35Show/hide
Query:  MIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLI
        MIFFIK LDGKAWRV+VGGYEPPM+TVN V VPKPEI+WTDAEE+ASVGNARAINA+F GVDLN+FKLINS TTAKEA KILEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLI

Query:  TSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMA----------------
        TSKFEALKMTEDE+VSEYNERVLEIANDSLLL EKI ESKIVCKVLRSLPRK DMKV AIEEAQDITTLKLDELFGSLLTFEMA                
Subjt:  TSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMA----------------

Query:  ----------SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYL
                  SGNE NQDES+ALLTKQFSKMARK                               RNSDH KKKEDVG SFRC+ECEG GHYQAECP YL
Subjt:  ----------SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYL

Query:  RRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS---------
         RQKKNYCATLSDE+SD+DEDDH          EINSE DSE  NI+EDEELTLEELK+LRKEDSEARAI+KERIQDLMDENERLM +++S         
Subjt:  RRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS---------

Query:  ---------------------------PDGSV-----------------------------------------------------------------IIV
                                    +GS                                                                  ++ 
Subjt:  ---------------------------PDGSV-----------------------------------------------------------------IIV

Query:  AEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDK
        + +CNVAFTTVQTHVDAWYFDS CSR MTGNRSFFTELEEC SGHVTF DGAKGKIIAKGNIDK
Subjt:  AEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDK

TrEMBL top hitse value%identityAlignment
A0A1S3BZ69 uncharacterized protein LOC1034950393.3e-15661.1Show/hide
Query:  EINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEK
        +I+WTDAEEQASVGNARAIN IF GVDLNVFKLI+SCTTAKEA KILEVAYEGTSKVKI RLQL+TSKFEALKM EDE+ SEYNERVLEIANDSLLLGEK
Subjt:  EINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEK

Query:  ISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMAS--------------------------GNETNQDESIALLTKQFSKMARKF
        I ESKIV KVLRSLPRK D+KV AIEEAQDITTLKLDELFGSLLTFEMA                           GNE NQDESIAL+ + FSKMARKF
Subjt:  ISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMAS--------------------------GNETNQDESIALLTKQFSKMARKF

Query:  KSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFN
        KSLNTAGKT                           +K EDV                AECPTYL+RQKKNYCATLSDEDSDDDEDDH    E DSEC +
Subjt:  KSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFN

Query:  IDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS-------------------PDGSV------------------------------
        IDEDEELT EELK+LRKEDSEARAI+KERIQDLMDENERLM +++S                    +GS                               
Subjt:  IDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS-------------------PDGSV------------------------------

Query:  -------------------------------IIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKSN
                                       +  +E+CN+AFTTVQTH DAWYFDS CSRHMT NR FFTELEEC+S HVTFGDGAKGKIIAKGNIDKSN
Subjt:  -------------------------------IIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKSN

Query:  LPCLNEVRYVNGLKANLISISQLCDQRYSVNFNNTGCVVTDKNNQ
        LPCLNEVRYV+GL ANL S+S+LCDQ Y+VNFNNT C+VTDKNNQ
Subjt:  LPCLNEVRYVNGLKANLISISQLCDQRYSVNFNNTGCVVTDKNNQ

A0A1S4E5V5 uncharacterized protein LOC1079922541.8e-16261.35Show/hide
Query:  MIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLI
        MIFFIK LDGKAWRV+VGGYEPPM+TVN V VPKPEI+WTDAEE+ASVGNARAINA+F GVDLN+FKLINS TTAKEA KILEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLI

Query:  TSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMA----------------
        TSKFEALKMTEDE+VSEYNERVLEIANDSLLL EKI ESKIVCKVLRSLPRK DMKV AIEEAQDITTLKLDELFGSLLTFEMA                
Subjt:  TSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMA----------------

Query:  ----------SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYL
                  SGNE NQDES+ALLTKQFSKMARK                               RNSDH KKKEDVG SFRC+ECEG GHYQAECP YL
Subjt:  ----------SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYL

Query:  RRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS---------
         RQKKNYCATLSDE+SD+DEDDH          EINSE DSE  NI+EDEELTLEELK+LRKEDSEARAI+KERIQDLMDENERLM +++S         
Subjt:  RRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS---------

Query:  ---------------------------PDGSV-----------------------------------------------------------------IIV
                                    +GS                                                                  ++ 
Subjt:  ---------------------------PDGSV-----------------------------------------------------------------IIV

Query:  AEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDK
        + +CNVAFTTVQTHVDAWYFDS CSR MTGNRSFFTELEEC SGHVTF DGAKGKIIAKGNIDK
Subjt:  AEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDK

A0A5A7V0X1 Gag-pol polyprotein2.5e-16474.83Show/hide
Query:  MVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVL
        M+TVN V VPK EI WTDA+EQA VGNARAINAIF  VDLNVFKLINSCTTAKEA KILEVAYEGTSK KISRLQLITSKFEALKMTEDE VSEYNERVL
Subjt:  MVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVL

Query:  EIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMASGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGR
        +IAN+SLLLGEKI E KIV KVLRSLPRK DMKV A EEAQ+ITTLKLDELFGSLLTFEM S   ++++   A LTKQFSKMAR FK LNT GKT+KT R
Subjt:  EIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMASGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGR

Query:  HDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDED
        HD ENS RKVN+ SYRRNSDHGKK EDVGRSFRC+EC+GFGHYQ +CPTYL+RQKKNYC+TLSDEDSDDDEDDH          +INSE DSECF+IDED
Subjt:  HDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDED

Query:  EELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS------PDGS-----------VIIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRS
        EELTLEEL +LRKE SEARAI+KERIQDLMDENERLM +++S      P+             ++  +E+CNVAFTTVQTHVDAWYFDS  SRHMT NRS
Subjt:  EELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS------PDGS-----------VIIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRS

Query:  FFTELEECASGHVTFGDGAKGKIIAKGNIDKSNLPCLNEVR
        FFTELEECASGHVTFGDGAKGKII K   DK+N   +++ R
Subjt:  FFTELEECASGHVTFGDGAKGKIIAKGNIDKSNLPCLNEVR

A0A5D3D1H0 Gag-proteinase polyprotein3.9e-15776.79Show/hide
Query:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT
        MEIIREGPSAS  PVLDGKNYSYWKP MIFFIKTLDGKAWR LVGGYEPPMVTVN V VPKPE++WTDAEEQASVGNARAINAIF  VDLN FKLINSCT
Subjt:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT

Query:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE
        TAKEA KILEVAYEGTSKVKISRLQLITSKFEA KM EDESVS+YNERVLEIANDSLLLGEKI ESKIV KVLRSLPRK DMKV AIEEA+DITT+KLDE
Subjt:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE

Query:  LFGSLLTFEMASG-----------------------NETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKED
        LFGSLLTFEMA                         N+ NQDESIALLTKQFSKMARKFKSLNTAG+T+KTGRHDGENS RKVN+ SYRRN+DH KK ED
Subjt:  LFGSLLTFEMASG-----------------------NETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKED

Query:  VGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLM
        VGRSFRCKECEGFGHYQAE    +      + A ++           EINSEVDSECF+ DEDEELTLE+LKMLRKEDSEARAI+KERIQ+LM+ENERL+
Subjt:  VGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLM

Query:  RVLNS
         +++S
Subjt:  RVLNS

A0A5D3DG33 Gag-pol polyprotein1.2e-17183.72Show/hide
Query:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT
        MEIIREGPS S PPVLDGKNYSYWKP MIFFIKTLDGKAWR LVGGYE  +VTVN V VPKPEI+WTDAEEQASV NARAIN IF GVDLNVFKLINSCT
Subjt:  MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCT

Query:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE
        TAKEA KIL+VAYEGTSKVKISRLQLITSKFEALKMTEDE+VSEYNERVLEI NDSLL  EKISESKIV KVL SLPRK D KV AIEEAQDITTLKLD+
Subjt:  TAKEARKILEVAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDE

Query:  LFGSLLTFEMA-SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPT
        L G LLTFEMA S  ETNQDESIA+LTKQFSKMARKFKSLNTAG+++KTGRHDGENS RKVN+FSYRRNSDHGKKKEDVGRSFRC+ECEGFGHYQ ECPT
Subjt:  LFGSLLTFEMA-SGNETNQDESIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPT

Query:  YLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS
        YLRR+ KNYCATL DED DD+EDDH          +INSE DSECF+ +EDEELTLEELKMLRKEDSEARAI+KERIQDLMD+NE+LM V++S
Subjt:  YLRRQKKNYCATLSDEDSDDDEDDH----------EINSEVDSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein1.2e-0430Show/hide
Query:  FTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGK---IIAKG-----NIDKSNLPCLNEVRYVNGLKANLISISQLCDQRYSVNF-
        F+    H + W   S  S HMT +  FFT L+      V F  G K +    + +G      I       +  V YV G++ N +S+SQL    + V+  
Subjt:  FTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGDGAKGK---IIAKG-----NIDKSNLPCLNEVRYVNGLKANLISISQLCDQRYSVNF-

Query:  NNTGCVVTDK
          TGC V D+
Subjt:  NNTGCVVTDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCACATCCTCCTGTACTGGATGGTAAAAATTACTCTTATTGGAAGCCTCACATGATATTTTTCATTAAAACCTTGGATGG
AAAAGCATGGAGAGTACTTGTTGGTGGTTATGAACCTCCAATGGTCACTGTGAATGAAGTATTAGTGCCAAAACCAGAAATTAACTGGACAGATGCTGAAGAACAAGCTT
CAGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAAAGGTGTCGATTTAAACGTGTTCAAACTTATCAATTCCTGCACTACTGCTAAAGAAGCTCGAAAAATATTGGAA
GTTGCATATGAAGGAACTTCTAAAGTGAAGATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGACTGAAGATGAGTCAGTTTCTGAATACAATGA
GAGGGTCCTGGAGATAGCTAATGATTCGCTACTACTTGGTGAAAAGATATCTGAGTCTAAAATTGTTTGCAAAGTGTTACGCTCTTTACCCAGAAAGCTTGACATGAAGG
TCATTGCCATAGAAGAAGCCCAAGATATAACGACGTTAAAACTTGACGAGTTATTTGGATCGCTACTTACGTTTGAAATGGCTTCTGGTAATGAAACTAATCAAGATGAG
TCAATAGCTCTCTTGACGAAGCAATTCTCTAAGATGGCCAGGAAGTTCAAAAGTTTGAATACTGCTGGAAAAACTAAAAAAACTGGAAGACATGATGGTGAGAACTCTAG
AAGAAAGGTTAATAACTTCTCTTACAGAAGAAATAGCGACCATGGTAAGAAAAAGGAGGATGTAGGGAGGTCGTTTAGATGTAAAGAATGTGAAGGATTTGGTCATTATC
AGGCCGAATGTCCCACTTATCTCAGAAGACAAAAGAAAAATTATTGTGCTACCCTGTCTGATGAGGACTCAGATGATGATGAAGATGATCATGAAATCAATTCTGAAGTT
GATAGTGAGTGTTTCAATATTGATGAGGATGAAGAGCTAACACTTGAAGAACTCAAAATGCTGAGGAAGGAAGACTCAGAAGCCAGAGCTATTAAAAAAGAAAGAATTCA
AGATTTGATGGACGAAAATGAACGATTGATGAGAGTGCTAAATTCTCCAGATGGGTCTGTTATTATTGTGGCAGAAGAGTGCAATGTTGCATTTACAACAGTTCAAACCC
ATGTTGATGCTTGGTACTTCGACAGTGCATGCTCAAGACATATGACTGGCAATCGATCTTTTTTTACTGAGTTAGAAGAGTGTGCCTCTGGTCATGTCACTTTTGGAGAT
GGAGCCAAAGGAAAAATTATTGCAAAAGGAAACATTGACAAAAGTAATCTACCCTGTCTAAATGAAGTTAGATATGTGAATGGACTGAAGGCAAACTTGATTAGTATAAG
TCAACTATGTGACCAAAGATACAGTGTAAACTTTAACAACACTGGCTGTGTAGTTACAGACAAGAATAATCAGTGTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCACATCCTCCTGTACTGGATGGTAAAAATTACTCTTATTGGAAGCCTCACATGATATTTTTCATTAAAACCTTGGATGG
AAAAGCATGGAGAGTACTTGTTGGTGGTTATGAACCTCCAATGGTCACTGTGAATGAAGTATTAGTGCCAAAACCAGAAATTAACTGGACAGATGCTGAAGAACAAGCTT
CAGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAAAGGTGTCGATTTAAACGTGTTCAAACTTATCAATTCCTGCACTACTGCTAAAGAAGCTCGAAAAATATTGGAA
GTTGCATATGAAGGAACTTCTAAAGTGAAGATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGACTGAAGATGAGTCAGTTTCTGAATACAATGA
GAGGGTCCTGGAGATAGCTAATGATTCGCTACTACTTGGTGAAAAGATATCTGAGTCTAAAATTGTTTGCAAAGTGTTACGCTCTTTACCCAGAAAGCTTGACATGAAGG
TCATTGCCATAGAAGAAGCCCAAGATATAACGACGTTAAAACTTGACGAGTTATTTGGATCGCTACTTACGTTTGAAATGGCTTCTGGTAATGAAACTAATCAAGATGAG
TCAATAGCTCTCTTGACGAAGCAATTCTCTAAGATGGCCAGGAAGTTCAAAAGTTTGAATACTGCTGGAAAAACTAAAAAAACTGGAAGACATGATGGTGAGAACTCTAG
AAGAAAGGTTAATAACTTCTCTTACAGAAGAAATAGCGACCATGGTAAGAAAAAGGAGGATGTAGGGAGGTCGTTTAGATGTAAAGAATGTGAAGGATTTGGTCATTATC
AGGCCGAATGTCCCACTTATCTCAGAAGACAAAAGAAAAATTATTGTGCTACCCTGTCTGATGAGGACTCAGATGATGATGAAGATGATCATGAAATCAATTCTGAAGTT
GATAGTGAGTGTTTCAATATTGATGAGGATGAAGAGCTAACACTTGAAGAACTCAAAATGCTGAGGAAGGAAGACTCAGAAGCCAGAGCTATTAAAAAAGAAAGAATTCA
AGATTTGATGGACGAAAATGAACGATTGATGAGAGTGCTAAATTCTCCAGATGGGTCTGTTATTATTGTGGCAGAAGAGTGCAATGTTGCATTTACAACAGTTCAAACCC
ATGTTGATGCTTGGTACTTCGACAGTGCATGCTCAAGACATATGACTGGCAATCGATCTTTTTTTACTGAGTTAGAAGAGTGTGCCTCTGGTCATGTCACTTTTGGAGAT
GGAGCCAAAGGAAAAATTATTGCAAAAGGAAACATTGACAAAAGTAATCTACCCTGTCTAAATGAAGTTAGATATGTGAATGGACTGAAGGCAAACTTGATTAGTATAAG
TCAACTATGTGACCAAAGATACAGTGTAAACTTTAACAACACTGGCTGTGTAGTTACAGACAAGAATAATCAGTGTTCATGA
Protein sequenceShow/hide protein sequence
MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVPKPEINWTDAEEQASVGNARAINAIFKGVDLNVFKLINSCTTAKEARKILE
VAYEGTSKVKISRLQLITSKFEALKMTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITTLKLDELFGSLLTFEMASGNETNQDE
SIALLTKQFSKMARKFKSLNTAGKTKKTGRHDGENSRRKVNNFSYRRNSDHGKKKEDVGRSFRCKECEGFGHYQAECPTYLRRQKKNYCATLSDEDSDDDEDDHEINSEV
DSECFNIDEDEELTLEELKMLRKEDSEARAIKKERIQDLMDENERLMRVLNSPDGSVIIVAEECNVAFTTVQTHVDAWYFDSACSRHMTGNRSFFTELEECASGHVTFGD
GAKGKIIAKGNIDKSNLPCLNEVRYVNGLKANLISISQLCDQRYSVNFNNTGCVVTDKNNQCS