; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g25580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g25580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr2:18246172..18252812
RNA-Seq ExpressionMoc02g25580
SyntenyMoc02g25580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.1e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

A0A5A7TWB9 Gag/pol protein1.1e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

A0A5A7UGV2 Gag/pol protein1.1e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

A0A5D3CPJ6 Gag/pol protein1.1e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

A0A5D3CSZ6 Gag/pol protein1.1e-13370.45Show/hide
Query:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS
        M ++ + +LA  +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE+ +TA+EIM+SLQ 
Subjt:  MFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQS

Query:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+ + EG+SVRE+VLN+MVHFNVAE NGAVIDE SQVSFIL+SLP+SFL FRSNAVMNK+ YTLTTLL ELQT++SLMK KG
Subjt:  MFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET
        Q+GEANVATS ++F+RGS+SGT+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN +GHWKRNCPKYLAEKKKA + KYDLLVLET
Subjt:  QEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+  EMT++VGTG V SA+A
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-0620.85Show/hide
Query:  RLNGEN-YKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQSMFGQLSSQARH
        + NG+N +  W+  +  +L+   L  VL  D  +     A        + W   +++A   I   +SD +     +  TA+ I   L+S++   +   + 
Subjt:  RLNGEN-YKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHENTVTAKEIMNSLQSMFGQLSSQARH

Query:  EALKFIYNSPIKEGSSVRENVLNLMVHFNV-------AESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKGQEGE
           K +Y   + EG+       N + H NV         + G  I+E+ +   +L SLP S+    +  +  K    L  +   L   + + K    +G+
Subjt:  EALKFIYNSPIKEGSSVRENVLNLMVHFNV-------AESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTLLKELQTYQSLMKCKGQEGE

Query:  ANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYD-----------
        A +              R       S  + +  A GK           K +V+      C++CN  GH+KR+CP     K + +  K D           
Subjt:  ANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYD-----------

Query:  --LLVL---ETCL-VENDDSAWILDSGATNH
          +L +   E C+ +   +S W++D+ A++H
Subjt:  --LLVL---ETCL-VENDDSAWILDSGATNH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-0621.57Show/hide
Query:  VGSHLRGVVNMFTSI--IALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNA---YDRWIKANDKAKVYILASISDVLAKKH
        + +H   +V + T+I  + +    +L   NY  W   ++ +    +L   L    P  P    T AV      Y RW + +      IL +IS  +    
Subjt:  VGSHLRGVVNMFTSI--IALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNA---YDRWIKANDKAKVYILASISDVLAKKH

Query:  ENTVTAKEIMNSLQSMFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTL
            TA +I                 E L+ IY +P    S      L  +  F+     G  +D   QV  +L++LP  + P            +LT +
Subjt:  ENTVTAKEIMNSLQSMFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLTTL

Query:  LKELQTYQS-LMKCKGQEG---EANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYL
         + L   +S L+     E     ANV T +  N   +   R       ++ +          +P S+ +    +      G+C  C++ GH  + CP+  
Subjt:  LKELQTYQS-LMKCKGQEG---EANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYL

Query:  AEKKKANED-----------KYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGIS
          +   N+            + +L V       N    W+LDSGAT+H+ S F  +S
Subjt:  AEKKKANED-----------KYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGIS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTGCATGTCGTCCTGGAGTGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGACCTCTGAGGTCCGAAAATGTTGGGTCACACTTACGAGGAGTTGTTAA
CATGTTTACTTCTATTATTGCACTCTTAGCCACGCTAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGT
TCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGTGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGATCGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATC
TTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGAACACGGTCACCGCTAAGGAGATCATGAACTCGCTGCAGAGCATGTTTGGACAACTGTCCTCACAGGCTCG
ACATGAAGCCCTTAAGTTCATTTACAATTCCCCCATAAAGGAGGGTTCATCAGTGCGAGAAAACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGG
CCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTGAAATCTCTTCCGAAGAGTTTCTTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACC
ACGCTCCTAAAGGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCGTCCTCTGG
AACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCA
AGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAATGAAGATAAATATGAT
TTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCTGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAG
GCAGCTTGACGCCAGAGAGATGACTCTCAAGGTCGGAACGGGAGAGGTCTTCTCAGCTGTGGCAGACTTGAACAAAGGGCCCCACCCTCTCATTGGCCGAGAGGGACTTC
CGGCTATTGGTTGGACCATAACCAGGTTGTTCATTAGAGGAGCAGTGGTGTTTAAGGAGTGCTACCATAAGCACGATCCCGAGACCCGAGAGGATAGCGAGGAAGATCCG
GTGGTGGTGTTCGAAGGGAACTCACTGAAGAAACGTTCTTCAAAGGTTTGGTTTTTCCCCTGTATTCCTTATTTCCAATTCGATTTTGTATCCCGAAAATCAGCAACGAA
TATCCGCTTCCGCTCCGGGATTCACATCCCTTCAAAATCCTCGGACACACTTAAAGCTCTCTTGACTTTCCCAAACAATGTTCAGGATCTCGCTGAGGTATCTTCGTTAG
AAGCGCACAAGCAAGTCCCCCTGACACCTCGTCGAGACTCCTTTGCCCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCTGCATGTCGTCCTGGAGTGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGACCTCTGAGGTCCGAAAATGTTGGGTCACACTTACGAGGAGTTGTTAA
CATGTTTACTTCTATTATTGCACTCTTAGCCACGCTAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGT
TCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGTGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGATCGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATC
TTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGAACACGGTCACCGCTAAGGAGATCATGAACTCGCTGCAGAGCATGTTTGGACAACTGTCCTCACAGGCTCG
ACATGAAGCCCTTAAGTTCATTTACAATTCCCCCATAAAGGAGGGTTCATCAGTGCGAGAAAACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGG
CCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTGAAATCTCTTCCGAAGAGTTTCTTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACC
ACGCTCCTAAAGGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCGTCCTCTGG
AACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCA
AGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAATGAAGATAAATATGAT
TTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCTGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAG
GCAGCTTGACGCCAGAGAGATGACTCTCAAGGTCGGAACGGGAGAGGTCTTCTCAGCTGTGGCAGACTTGAACAAAGGGCCCCACCCTCTCATTGGCCGAGAGGGACTTC
CGGCTATTGGTTGGACCATAACCAGGTTGTTCATTAGAGGAGCAGTGGTGTTTAAGGAGTGCTACCATAAGCACGATCCCGAGACCCGAGAGGATAGCGAGGAAGATCCG
GTGGTGGTGTTCGAAGGGAACTCACTGAAGAAACGTTCTTCAAAGGTTTGGTTTTTCCCCTGTATTCCTTATTTCCAATTCGATTTTGTATCCCGAAAATCAGCAACGAA
TATCCGCTTCCGCTCCGGGATTCACATCCCTTCAAAATCCTCGGACACACTTAAAGCTCTCTTGACTTTCCCAAACAATGTTCAGGATCTCGCTGAGGTATCTTCGTTAG
AAGCGCACAAGCAAGTCCCCCTGACACCTCGTCGAGACTCCTTTGCCCCCTGA
Protein sequenceShow/hide protein sequence
MTCMSSWSDHPYGGFIDYWGGPLRSENVGSHLRGVVNMFTSIIALLATLRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPVPNATVAVRNAYDRWIKANDKAKVYI
LASISDVLAKKHENTVTAKEIMNSLQSMFGQLSSQARHEALKFIYNSPIKEGSSVRENVLNLMVHFNVAESNGAVIDEQSQVSFILKSLPKSFLPFRSNAVMNKLEYTLT
TLLKELQTYQSLMKCKGQEGEANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMDGHWKRNCPKYLAEKKKANEDKYD
LLVLETCLVENDDSAWILDSGATNHVCSSFQGISSWRQLDAREMTLKVGTGEVFSAVADLNKGPHPLIGREGLPAIGWTITRLFIRGAVVFKECYHKHDPETREDSEEDP
VVVFEGNSLKKRSSKVWFFPCIPYFQFDFVSRKSATNIRFRSGIHIPSKSSDTLKALLTFPNNVQDLAEVSSLEAHKQVPLTPRRDSFAP