; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:19585060..19591486
RNA-Seq ExpressionMoc08g27090
SyntenyMoc08g27090
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12971.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12971.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12971.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12971.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12971.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein9.8e-13071.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

A0A5A7SMH8 Gag/pol protein3.4e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

A0A5A7SMH8 Gag/pol protein9.8e-13071.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

A0A5A7TWB9 Gag/pol protein3.4e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

A0A5A7TWB9 Gag/pol protein9.8e-13071.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

A0A5A7UGV2 Gag/pol protein3.4e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

A0A5D3CPJ6 Gag/pol protein3.4e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

A0A5D3CPJ6 Gag/pol protein9.8e-13071.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

A0A5D3CSZ6 Gag/pol protein3.4e-2157.84Show/hide
Query:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE
        K+V+NE+S+E T  STRVV++    TRVV   S++R TH PQ LR PRRSGR+ +    Y+ LTET  VI D  +EDPLT+KKAMED DKD+W+KAM+LE
Subjt:  KVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGRIVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLE

Query:  ME
        +E
Subjt:  ME

A0A5D3CSZ6 Gag/pol protein9.8e-13071.05Show/hide
Query:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS
        M+++ + +LAA KLN  N   WK+ +NT+L+IDDLRF+L E CPQ PA NAT   R  Y++W KANEKA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ 
Subjt:  MSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDKWIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG
        MFGQ S Q +H+ALK++YN+RM EG+SVREHVLNMMVHFNVAE N  VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKSKG

Query:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET
        Q+GE NVATS ++F+RGS+SGTKS PSSSG+K +KKK  G+G+K +   AAAK  K   A KG CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET
Subjt:  QEGETNVATS-KRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYLAEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV
        CLVENDDSAWI+DS ATNHVCSSFQGISSWRQL+ GEMT++V
Subjt:  CLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAGCGCCATGGCGCTGTAGGGACAGCACATAGCGCCGCGGCACTGTGCAGCGCCATGGCGCCATGCCTGGGCGCTGCGGCGCTGCCCTTAGGCACCAAGGCGCT
GTCCCAGGTGTTTTTCGGCGCGTTTCCGTGGTTCTGCATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGACGAGAATGACAAACAATGGAAATCGA
ATCTAAACACTATTCTCGTGATAGATGATCTTAGATTCATCTTGCAAGAGAATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGTGGAGCGCAACACCTATGACAAG
TGGATCAAGGCCAATGAAAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCT
GCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACA
TGATGGTCCACTTCAACGTGGCTGAGTCGAATGAGGTCGTCATTGACGAGCAGAGTCAGGTCAGCTTCATTCTAGAATCTCTTCCGAAGAGTTTCCTGCCATTTCGCAGC
AATGCGGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGACAAGAAGGGGAGACAAATGTTGC
CACCTCAAAGAGGTTCAATCGAGGTTCATCCTCTGGAACCAAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTG
ACTCCGCTGCTGCTGCTGCCAAAAAAGGCAAGGGCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGTGCAACTGCCCAAAGTATTTG
GCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTCTGCCTGGATACTGGATTCAAGAGCCACTAATCA
CGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCTGGAGAGATGACTCTCAAGGTAGTGATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAA
CAAGGGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGACACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGG
ATTGTGTCACAACTTAACCTATATGTGGGTTTAACTGAAACCCAAGTTGTCATACCTGATGACGGGGTCGAGGATCCATTGACCTACAAAAAGGCAATGGAAGATACTGA
CAAGGATAAATGGGTCAAAGCAATGGACCTGGAAATGGAGTTGATACGACTTGATCTAGGTCGTTCGTGTGGAGACATGCGAGGTGGGGTATCCTATACAATGAGTTTGT
ATAAGACCGGACCACGAAATAGCCAATCTTTAGATGTAACACTGTTGACTAATAGATTGTTGTTTCTTAGGATGACCAGACAACTCATTCTCAATCCTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAGCGCCATGGCGCTGTAGGGACAGCACATAGCGCCGCGGCACTGTGCAGCGCCATGGCGCCATGCCTGGGCGCTGCGGCGCTGCCCTTAGGCACCAAGGCGCT
GTCCCAGGTGTTTTTCGGCGCGTTTCCGTGGTTCTGCATGTCTGCTTCCATTATTGCACTCCTAGCCGCTCAAAAACTTAACGACGAGAATGACAAACAATGGAAATCGA
ATCTAAACACTATTCTCGTGATAGATGATCTTAGATTCATCTTGCAAGAGAATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGTGGAGCGCAACACCTATGACAAG
TGGATCAAGGCCAATGAAAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCT
GCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACA
TGATGGTCCACTTCAACGTGGCTGAGTCGAATGAGGTCGTCATTGACGAGCAGAGTCAGGTCAGCTTCATTCTAGAATCTCTTCCGAAGAGTTTCCTGCCATTTCGCAGC
AATGCGGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGACAAGAAGGGGAGACAAATGTTGC
CACCTCAAAGAGGTTCAATCGAGGTTCATCCTCTGGAACCAAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTG
ACTCCGCTGCTGCTGCTGCCAAAAAAGGCAAGGGCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGTGCAACTGCCCAAAGTATTTG
GCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTCTGCCTGGATACTGGATTCAAGAGCCACTAATCA
CGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAGCTTGACGCTGGAGAGATGACTCTCAAGGTAGTGATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAA
CAAGGGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGACACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGG
ATTGTGTCACAACTTAACCTATATGTGGGTTTAACTGAAACCCAAGTTGTCATACCTGATGACGGGGTCGAGGATCCATTGACCTACAAAAAGGCAATGGAAGATACTGA
CAAGGATAAATGGGTCAAAGCAATGGACCTGGAAATGGAGTTGATACGACTTGATCTAGGTCGTTCGTGTGGAGACATGCGAGGTGGGGTATCCTATACAATGAGTTTGT
ATAAGACCGGACCACGAAATAGCCAATCTTTAGATGTAACACTGTTGACTAATAGATTGTTGTTTCTTAGGATGACCAGACAACTCATTCTCAATCCTGAGTGA
Protein sequenceShow/hide protein sequence
MQQRHGAVGTAHSAAALCSAMAPCLGAAALPLGTKALSQVFFGAFPWFCMSASIIALLAAQKLNDENDKQWKSNLNTILVIDDLRFILQENCPQAPAPNATVVERNTYDK
WIKANEKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNMMVHFNVAESNEVVIDEQSQVSFILESLPKSFLPFRS
NAVMNKLEYTLTTLLNELQTYQSLMKSKGQEGETNVATSKRFNRGSSSGTKSAPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKGKVAEKGKCFHCNMDGHWKCNCPKYL
AEKKKANEGKYDLLVLETCLVENDDSAWILDSRATNHVCSSFQGISSWRQLDAGEMTLKVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQTHPPQVLRVPRRSGR
IVSQLNLYVGLTETQVVIPDDGVEDPLTYKKAMEDTDKDKWVKAMDLEMELIRLDLGRSCGDMRGGVSYTMSLYKTGPRNSQSLDVTLLTNRLLFLRMTRQLILNPE