; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G015548 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G015548
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGag protease polyprotein
Genome locationGy14Chr6:14390606..14391757
RNA-Seq ExpressionCsGy6G015548
SyntenyCsGy6G015548
Gene Ontology termsGO:0006885 - regulation of pH (biological process)
GO:1902600 - proton transmembrane transport (biological process)
GO:0012505 - endomembrane system (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0015299 - solute:proton antiporter activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN57866.2 hypothetical protein Csa_011500 [Cucumis sativus]2.60e-23986.41Show/hide
Query:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG
        MPP+  VRRGGRRGRG+GAG       RNQPTEGQAE R+P APVTHVEF ALSAHMEQRFTELMTAIAQNQQAPAVPPAPV+PP PAAPPA        
Subjt:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG

Query:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC
              Q LPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAE+WLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTM ML GDVRQ+TWDQFKDC
Subjt:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC

Query:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK
        FYTKFFSANLRDAKSQEFLELKQG+MTVEEYDQEFDMLSRF PELV NEQARADRF+KGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDE + RSF+K
Subjt:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK

Query:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
        GSSSGQKRK EQRTVGVPQRN+R GD FRSFQQSSGGAGDTT+E+P+C+TCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAG SS
Subjt:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS

XP_031741726.1 uncharacterized protein LOC116403920 [Cucumis sativus]4.69e-26193.33Show/hide
Query:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG
        MPP+ GVRRGGRRGRG+GAG       RNQPTEGQAEQR+P APVTHVEF ALSAHMEQRFTELMTAIA+NQQAPAVPPAPV+PPVPAAPPAP  PPAQG
Subjt:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG

Query:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC
        LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTM ML GDVRQ+TWDQFK+C
Subjt:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC

Query:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK
        FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRF PELVGNEQARADRF+KGLRDEIR FVRALKPTTQAEALRLAVDMSIGKDEIR RSFDK
Subjt:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK

Query:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
        GSSSGQKRKAEQRTVGVPQRNLR GDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
Subjt:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS

XP_031742890.1 uncharacterized protein LOC116404512 [Cucumis sativus]1.67e-26193.33Show/hide
Query:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG
        MPP+ GVRRGGRRGRG+GAG       RNQPTEGQAEQR+P APVTHVEF ALSAHMEQRFTELMTAIA+NQQAPAVPPAPV+PPVPAAPPAP  PPAQG
Subjt:  MPPKRGVRRGGRRGRGKGAG-------RNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQG

Query:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC
        LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTM ML GDVRQ+TWDQFK+C
Subjt:  LAAQQPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDC

Query:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK
        FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRF PELVGNEQARADRF+KGLRDEIR FVRALKPTTQAEALRLAVDMSIGKDEIR RSFDK
Subjt:  FYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDK

Query:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
        GSSSGQKRKAEQRTVGVPQRNLR GDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
Subjt:  GSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS

XP_031743557.1 uncharacterized protein LOC116404620 isoform X1 [Cucumis sativus]4.53e-278100Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ
        MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ

Query:  ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS
        ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS
Subjt:  ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS

Query:  ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK
        ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK
Subjt:  ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK

Query:  RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
        RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
Subjt:  RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS

XP_031743561.1 uncharacterized protein LOC116404620 isoform X2 [Cucumis sativus]2.38e-278100Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ
        MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQ

Query:  ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS
        ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS
Subjt:  ILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFS

Query:  ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK
        ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK
Subjt:  ANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQK

Query:  RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
        RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS
Subjt:  RKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS

TrEMBL top hitse value%identityAlignment
A0A5A7SQP7 Gag protease polyprotein6.80e-15360.78Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQA--PAVPPAPVIPPVPAAPPAPTTPPAQGLAAQ
        MPP+RG RRGGR GRG+GAGR QP  +  A+   P APVTH + AA    MEQRF +L+  + + QQ   PA  PAP + P PA  P P           
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQA--PAVPPAPVIPPVPAAPPAPTTPPAQGLAAQ

Query:  QPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTK
         PQ++P+QLSAEAKHLRDFRKY+P TFDGSLEDPT+A+LWLSS+ TIF YM+CPE+ +VQCA FLL DRG  WW TT  ML GDV Q+TW QFK+ FY K
Subjt:  QPQILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTK

Query:  FFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSS
        FFSA+LRDAK QEFL L+QG MTVE+YD EFDMLSRF PE++  E ARAD+F++GL+ +I+G VRA +P T A+ALRLAVD+S+ +    +++  +GS+S
Subjt:  FFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSS

Query:  GQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS
         QKRKAEQ+ V VPQRN RSG  FR FQQ    AG+  R KPLC TCGK HLGRCL GTR C+KC+QEGH ADRCPLR TG  Q+
Subjt:  GQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS

A0A5A7STC8 Gag protease polyprotein2.23e-15462.14Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP
        MPP+RG RRGGR GRG+GAGR QP  +  A+   P APVTH + AA    MEQRF +L+  + + QQ PA P  P + P PA  PAP   PA       P
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP

Query:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF
        Q +P+QLSAEAKHLRDFRKY+P TFDGSLEDPT+A++WLSS+ETIF YM+CPE+ +VQCA F+L DRG  WW TT  ML GDV Q+TW QFK+ FY KFF
Subjt:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF

Query:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ
        SA+LRDAK QEFL L+QG MTVE YD EFDMLSRF PE++  E ARAD+F++GLR +I+G VRA +P T A+ALRLAVD+S+ +    +++  +GS+SGQ
Subjt:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ

Query:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS
        KRKAEQ+ V VPQRN RSG  FRSFQQ    AG+  R KPLC TCGK HLGRCL+GTR C+KC+QEGH ADRCPLR TG  Q+
Subjt:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS

A0A5A7T3M7 Gag protease polyprotein1.01e-15360.95Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP
        MPP+RG RRGGR GRG+GAGR QP  +  A+   P APVTH + AA    MEQRF +++  + + Q+  +  PAP   P PA  PA    PA       P
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP

Query:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF
        Q +P+QLSAEAKHLRDFRKY+P TFDGSLEDPT+A++WLSS+ETIF YM+CPE+ +VQCA F+L DRG  WW TT  ML GDV Q+TW QFK+ FY KFF
Subjt:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF

Query:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ
        SA+LRDAK QEFL L+QG MTVE+YD EFDMLSRF PE++  E ARAD+F++GLR +I+G VRA +P T A+ALRLAVD+S+ +    +++  +GS+SGQ
Subjt:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ

Query:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTG
        KRKAEQ+ V VPQRN R    FRSFQQ    AG+  R KPLC TCGK HLGRCL GTR C+KC+QEGH ADRCPLR TG
Subjt:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTG

A0A5A7T9Z4 Gag protease polyprotein2.61e-15360.57Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP
        MPP+RG RRGGR GRG+GAGR QP  +  A+   P APVTH + AA    MEQRF +L+  + + Q+    P +P   P PA  PAP   PA       P
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP

Query:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF
        Q +P+QLSAEAKHLRDFRKY+P TFDGSLEDPT+A++WLSS+ETIF YM+CPE+ +VQCA F+L DRG  WW TT  ML GDV Q+TW QFK+ FY KFF
Subjt:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF

Query:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ
        SA+LRDAK QEFL L+QG MTVE+YD EFDMLSRF PE++  E ARAD+F++GLR +I+G VRA +P T A+ALRLAVD+++ +    +++  +G +SGQ
Subjt:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ

Query:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS
        KRKAEQ+ V VPQRN RSG  FR FQQ    AG+  REKPLC TCGK HLGRCL GTR C+KC+QEGH ADRCPLR  G  Q+
Subjt:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS

A0A5A7ULK8 Gag protease polyprotein1.28e-15761.62Show/hide
Query:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP
        MPP+RG RRGGR GRG+GAGR QP  +  A+   P APVTHV+ AA    MEQRF +L+  + + QQ    PPAP + P PA  PAP   P        P
Subjt:  MPPKRGVRRGGRRGRGKGAGRNQP-TEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQP

Query:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF
        Q++P+QLSAEAKHLRDFRKY+P TFDGSLEDPT+A+LWLSS+ETIF YM+CPE  +VQCA F+L DRG  WW TT  ML GDV Q+ W QFK+ FY KFF
Subjt:  QILPNQLSAEAKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFF

Query:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ
        SA+LRDAK QEFL L+QG MTVE+YD EFDML RF PE++  E ARAD+F++GLR +I+G VRA +P T A+ALRLAVD+S+ +    +++  +GS+SGQ
Subjt:  SANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQ

Query:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS
        KRKAEQ+ V VPQRN RSG  FR FQQ    AG+  R KPLC TCGK HLGRCL GTR C+KC+QEGH ADRCPLR TG  Q+
Subjt:  KRKAEQRTVGVPQRNLRSGDPFRSFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCAAAGAGAGGTGTACGTAGAGGAGGTCGTAGAGGTCGGGGTAAAGGAGCAGGTCGTAATCAACCTACTGAGGGTCAAGCTGAACAGCGAGTTCCTACTGCACC
TGTGACTCACGTTGAGTTTGCTGCACTGTCTGCTCACATGGAGCAGAGGTTCACGGAGCTTATGACGGCTATAGCTCAGAACCAGCAGGCACCTGCAGTTCCACCTGCAC
CTGTGATTCCCCCGGTACCAGCAGCTCCACCTGCACCAACAACCCCTCCTGCACAAGGATTGGCTGCACAACAGCCGCAGATATTACCGAACCAACTTTCTGCTGAGGCG
AAACATTTGAGGGACTTTAGGAAATATGACCCTCAAACGTTTGATGGGTCACTGGAGGATCCTACTAAAGCTGAGTTGTGGTTGTCCTCTGTGGAAACCATATTTAATTA
CATGAGATGTCCAGAGGAGCATAGAGTTCAGTGTGCTGCTTTTCTACTGAGGGACAGAGGCATTATCTGGTGGAGGACTACGATGTGTATGCTAAGTGGAGATGTGAGGC
AGGTTACCTGGGATCAGTTTAAAGACTGCTTCTATACCAAGTTTTTCTCGGCTAACCTTAGAGACGCCAAAAGCCAGGAATTCTTGGAGTTGAAGCAAGGACATATGACA
GTCGAGGAGTACGACCAGGAGTTTGATATGCTGTCGCGTTTTGTCCCTGAACTTGTTGGTAATGAGCAGGCTAGAGCTGATAGGTTCATCAAAGGATTGAGAGATGAGAT
TAGAGGCTTTGTGCGAGCACTAAAGCCCACTACCCAGGCTGAAGCGCTGCGTCTGGCAGTGGATATGAGCATTGGGAAGGATGAAATTCGGGCAAGGAGTTTTGATAAGG
GATCGTCATCTGGTCAAAAGAGGAAAGCAGAGCAGAGAACTGTGGGAGTTCCTCAGAGGAACTTGAGATCAGGCGATCCTTTTCGCAGTTTCCAGCAGAGTTCTGGCGGG
GCAGGAGACACTACTCGAGAGAAGCCACTATGCAATACGTGTGGGAAACGCCACCTGGGTCGTTGTTTGATGGGAACGAGAGTCTGTTATAAGTGCAAGCAAGAGGGACA
TATGGCTGATAGGTGTCCCTTGAGATCTACTGGGGCTGGACAGAGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCAAAGAGAGGTGTACGTAGAGGAGGTCGTAGAGGTCGGGGTAAAGGAGCAGGTCGTAATCAACCTACTGAGGGTCAAGCTGAACAGCGAGTTCCTACTGCACC
TGTGACTCACGTTGAGTTTGCTGCACTGTCTGCTCACATGGAGCAGAGGTTCACGGAGCTTATGACGGCTATAGCTCAGAACCAGCAGGCACCTGCAGTTCCACCTGCAC
CTGTGATTCCCCCGGTACCAGCAGCTCCACCTGCACCAACAACCCCTCCTGCACAAGGATTGGCTGCACAACAGCCGCAGATATTACCGAACCAACTTTCTGCTGAGGCG
AAACATTTGAGGGACTTTAGGAAATATGACCCTCAAACGTTTGATGGGTCACTGGAGGATCCTACTAAAGCTGAGTTGTGGTTGTCCTCTGTGGAAACCATATTTAATTA
CATGAGATGTCCAGAGGAGCATAGAGTTCAGTGTGCTGCTTTTCTACTGAGGGACAGAGGCATTATCTGGTGGAGGACTACGATGTGTATGCTAAGTGGAGATGTGAGGC
AGGTTACCTGGGATCAGTTTAAAGACTGCTTCTATACCAAGTTTTTCTCGGCTAACCTTAGAGACGCCAAAAGCCAGGAATTCTTGGAGTTGAAGCAAGGACATATGACA
GTCGAGGAGTACGACCAGGAGTTTGATATGCTGTCGCGTTTTGTCCCTGAACTTGTTGGTAATGAGCAGGCTAGAGCTGATAGGTTCATCAAAGGATTGAGAGATGAGAT
TAGAGGCTTTGTGCGAGCACTAAAGCCCACTACCCAGGCTGAAGCGCTGCGTCTGGCAGTGGATATGAGCATTGGGAAGGATGAAATTCGGGCAAGGAGTTTTGATAAGG
GATCGTCATCTGGTCAAAAGAGGAAAGCAGAGCAGAGAACTGTGGGAGTTCCTCAGAGGAACTTGAGATCAGGCGATCCTTTTCGCAGTTTCCAGCAGAGTTCTGGCGGG
GCAGGAGACACTACTCGAGAGAAGCCACTATGCAATACGTGTGGGAAACGCCACCTGGGTCGTTGTTTGATGGGAACGAGAGTCTGTTATAAGTGCAAGCAAGAGGGACA
TATGGCTGATAGGTGTCCCTTGAGATCTACTGGGGCTGGACAGAGCAGTTAG
Protein sequenceShow/hide protein sequence
MPPKRGVRRGGRRGRGKGAGRNQPTEGQAEQRVPTAPVTHVEFAALSAHMEQRFTELMTAIAQNQQAPAVPPAPVIPPVPAAPPAPTTPPAQGLAAQQPQILPNQLSAEA
KHLRDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRTTMCMLSGDVRQVTWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMT
VEEYDQEFDMLSRFVPELVGNEQARADRFIKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIRARSFDKGSSSGQKRKAEQRTVGVPQRNLRSGDPFRSFQQSSGG
AGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSS