; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03300 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03300
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:2505077..2518200
RNA-Seq ExpressionMoc03g03300
SyntenyMoc03g03300
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2659.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2659.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2659.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2659.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2659.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

A0A5A7SMH8 Gag/pol protein9.7e-2759.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

A0A5A7SMH8 Gag/pol protein1.8e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

A0A5A7TWB9 Gag/pol protein9.7e-2759.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

A0A5A7TWB9 Gag/pol protein1.8e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

A0A5A7UGV2 Gag/pol protein9.7e-2759.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

A0A5D3CPJ6 Gag/pol protein9.7e-2759.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

A0A5D3CPJ6 Gag/pol protein1.8e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

A0A5D3CSZ6 Gag/pol protein9.7e-2759.29Show/hide
Query:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM
        +V+NE+S+E T  STRVV++    TRVV   S++R +H PQ LR PRRSGR+ + P  Y+ LTET  VI D  +EDPLT+KKAMED D+D+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDEDKWVKAMDLEM

Query:  ESMYFNSVWELHD
        ESMYFNSVW+L D
Subjt:  ESMYFNSVWELHD

A0A5D3CSZ6 Gag/pol protein1.8e-13771.1Show/hide
Query:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS
        M ++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE M+TA+EIMDSLQ 
Subjt:  MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNV E NGAVIDE SQVSFILESLP+SFL FRSN VMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET
        Q+GEANV+TS ++F+RGS+ GT+  P SSG+K +KKKK  G+G+K + AAA    K K A KG CFHCN  GHWKRNCPKYL EKKKA +GKYDLLVLET
Subjt:  QEGEANVSTS-KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLET

Query:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV
        CLVENDDSAWI+DSGATNHVCSSFQGISSWRQL+ G+MT++VGTG V+S +AV
Subjt:  CLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-0922.32Show/hide
Query:  RLNGEN-YKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQARH
        + NG+N +  W+  +  +L+   L  VL  D  +     A        + W   +++A   I   +SD +     D  TA+ I   L+S++   +   + 
Subjt:  RLNGEN-YKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQARH

Query:  EALKFIYNSRMKEGSSVREHVLNLMVHFNVTESN-GAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVSTS
           K +Y   M EG++   H LN+        +N G  I+E+ +   +L SLP S+    + ++  K    L  + + L   + + K    +G+A ++  
Subjt:  EALKFIYNSRMKEGSSVREHVLNLMVHFNVTESN-GAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVSTS

Query:  KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEG-------------KYDLLVL
            RG SY              +   + G+      +    K +V+      C++CN  GH+KR+CP   P K K                   D +VL
Subjt:  KRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEG-------------KYDLLVL

Query:  -----ETCL-VENDDSAWILDSGATNH
             E C+ +   +S W++D+ A++H
Subjt:  -----ETCL-VENDDSAWILDSGATNH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-0621.3Show/hide
Query:  RLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVA---VRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQA
        +L   NY  W   ++ +    +L   L       PA   T A   V   Y RW + +      +L +IS  +        TA +I ++L+ ++  P S  
Subjt:  RLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVA---VRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQA

Query:  RHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVST
            L+       K   ++ +++  L+  F+     G  +D   QV  +LE+LP+ + P    +       TLT +   L  ++S +          ++ 
Subjt:  RHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVST

Query:  SKRFNRGSSYGTRYAPYSSGSKTFK-KKKDAGKGSKPDSAAAAQKGKVKVAEK---GKCFHCNMNGHWKRNCPK----------------YLPEKKKANE
        +   +R +   T     ++G++  +   ++    SKP   ++          K   GKC  C + GH  + C +                + P + +AN 
Subjt:  SKRFNRGSSYGTRYAPYSSGSKTFK-KKKDAGKGSKPDSAAAAQKGKVKVAEK---GKCFHCNMNGHWKRNCPK----------------YLPEKKKANE

Query:  GKYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGIS
             L L +    N+   W+LDSGAT+H+ S F  +S
Subjt:  GKYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGIS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.7e-0821.81Show/hide
Query:  RLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNA---YDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQA
        +L   NY  W   ++ +    +L   L    P  PA   T AV      Y RW + +      IL +IS  +        TA +I ++L+ ++  P S  
Subjt:  RLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNA---YDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQA

Query:  RHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLT----TLLNELQTYQSLMKCKGQEGEA
            L+FI                     F+     G  +D   QV  +LE+LP  + P    +       +LT     L+N      +L   +     A
Subjt:  RHEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLT----TLLNELQTYQSLMKCKGQEGEA

Query:  NVSTSKRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKAN-----------EGKYDL
        NV T +  N   +   R       ++ +    +     +P S+ +    +      G+C  C++ GH  + CP+    +   N           + + +L
Subjt:  NVSTSKRFNRGSSYGTRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKAN-----------EGKYDL

Query:  LVLETCLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAVVINEISEEATNTSTRVVD
         V       N    W+LDSGAT+H+ S F  + S+ Q   G   + +  G  I      I      +  TS+R +D
Subjt:  LVLETCLVENDDSAWILDSGATNHVCSSFQGISSWRQLDVGKMTLKVGTGDVISVVAVVINEISEEATNTSTRVVD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATACTTCTATTATTGCACTTTTAGCCGCGCAAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTT
TGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCT
TGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACATGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGA
CATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGACTGAGTCGAACGGGGC
CGTCATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCCGCAGCAATGTGGTTATGAATAAGCTGGAGTACACTCTTACCA
CGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTTCCACCTCAAAGAGGTTCAACAGAGGATCGTCCTATGGA
ACCAGGTATGCGCCCTATTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGATGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAA
GGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAACATGAACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGCCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATT
TACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGG
CAGCTTGACGTCGGAAAGATGACTCTCAAGGTCGGAACGGGAGATGTCATCTCAGTTGTAGCGGTAGTAATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAG
AGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGCACATCGCGTCAGTCACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGGATTG
TGTCACAACCTGACCACTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGACCTACAAAAAGGCAATGGAAGATACTGACGAG
GACAAATGGGTGAAAGCAATGGACTTGGAAATGGAGTCGATGTACTTCAATTCCGTTTGGGAACTTCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTAGT
GGTGCTCGAGAGAAACTCGTTGAAGAAACGTTCTTCAAAGGCCTCTAAGTTGACAACTACCAAAGAGGAAGTTAATAAGATTTGTAGGATTTCTGAAAGAGGATCCCTCT
CATTTGGATGTGCGTACTATCCAAGATCCTACAAAGATTCGAAACGAAGTGAATTCCTAAAATTAGTGCAAGGGGTGATGACCGTGGCAGAATATCATAAAAAATATATA
AAACTCTCCAGGTATGCATCTACTGTTATCGATGATGAGATTGACAGGTGTAGAAAGTTCGAAGATGGGTTGCGAGAGGAGATTGGGAGTCGCATTACTGCATCCGGATG
GCAAGAGTTTGGGCCTTTAGTAGAAGCAGCCGCCAGGGTTGAAAAAAGTGTGTTGGAAGGAAAGAAACAAAGAGAAGTATCGGAAAATGTGCAAGGTTCTTTTTCTAGCA
CTATGAGTTCTTCAGATAATAGAAAGTTTCGAAGCAACGATAGAGGTTTTGTACCAGGGACTGCGAGTGGTTCAGGTCAATTTAGAAGACCTAGTAAAAGTCAGCACACC
TTGCAGGAACTGGAGTCTGCATTTAGTTACAACATATCCTATTCTTCATGTCAGAAGTGTGGAAAGTTACATAATGGCCAGTGTTTGTGGGGAACAAATATTTGCTATAA
TTGTGGTCAGGTAGGTCACATGAGCAGAGAATATCCACAGCAAGTACATGAAGATAGTGTTTCGCAAGGTGCTTCCTCAAGACCGGTTGTTCAAGGATGTCCTGATACAC
AGATAAAACCATTCACAATGGGTAGCAAAGAACAAATCAATACAAGGAGTTCTAGCGAAGCAAAATCGAAAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATACTTCTATTATTGCACTTTTAGCCGCGCAAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTT
TGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCT
TGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACATGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGA
CATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGACTGAGTCGAACGGGGC
CGTCATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCCGCAGCAATGTGGTTATGAATAAGCTGGAGTACACTCTTACCA
CGCTCTTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTTCCACCTCAAAGAGGTTCAACAGAGGATCGTCCTATGGA
ACCAGGTATGCGCCCTATTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGATGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAA
GGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAACATGAACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGCCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATT
TACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGG
CAGCTTGACGTCGGAAAGATGACTCTCAAGGTCGGAACGGGAGATGTCATCTCAGTTGTAGCGGTAGTAATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAG
AGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGCACATCGCGTCAGTCACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGGATTG
TGTCACAACCTGACCACTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGACCTACAAAAAGGCAATGGAAGATACTGACGAG
GACAAATGGGTGAAAGCAATGGACTTGGAAATGGAGTCGATGTACTTCAATTCCGTTTGGGAACTTCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTAGT
GGTGCTCGAGAGAAACTCGTTGAAGAAACGTTCTTCAAAGGCCTCTAAGTTGACAACTACCAAAGAGGAAGTTAATAAGATTTGTAGGATTTCTGAAAGAGGATCCCTCT
CATTTGGATGTGCGTACTATCCAAGATCCTACAAAGATTCGAAACGAAGTGAATTCCTAAAATTAGTGCAAGGGGTGATGACCGTGGCAGAATATCATAAAAAATATATA
AAACTCTCCAGGTATGCATCTACTGTTATCGATGATGAGATTGACAGGTGTAGAAAGTTCGAAGATGGGTTGCGAGAGGAGATTGGGAGTCGCATTACTGCATCCGGATG
GCAAGAGTTTGGGCCTTTAGTAGAAGCAGCCGCCAGGGTTGAAAAAAGTGTGTTGGAAGGAAAGAAACAAAGAGAAGTATCGGAAAATGTGCAAGGTTCTTTTTCTAGCA
CTATGAGTTCTTCAGATAATAGAAAGTTTCGAAGCAACGATAGAGGTTTTGTACCAGGGACTGCGAGTGGTTCAGGTCAATTTAGAAGACCTAGTAAAAGTCAGCACACC
TTGCAGGAACTGGAGTCTGCATTTAGTTACAACATATCCTATTCTTCATGTCAGAAGTGTGGAAAGTTACATAATGGCCAGTGTTTGTGGGGAACAAATATTTGCTATAA
TTGTGGTCAGGTAGGTCACATGAGCAGAGAATATCCACAGCAAGTACATGAAGATAGTGTTTCGCAAGGTGCTTCCTCAAGACCGGTTGTTCAAGGATGTCCTGATACAC
AGATAAAACCATTCACAATGGGTAGCAAAGAACAAATCAATACAAGGAGTTCTAGCGAAGCAAAATCGAAAGGGTGA
Protein sequenceShow/hide protein sequence
MYTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDMVTAKEIMDSLQSMFGQPSSQAR
HEALKFIYNSRMKEGSSVREHVLNLMVHFNVTESNGAVIDEQSQVSFILESLPKSFLPFRSNVVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVSTSKRFNRGSSYG
TRYAPYSSGSKTFKKKKDAGKGSKPDSAAAAQKGKVKVAEKGKCFHCNMNGHWKRNCPKYLPEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGISSWR
QLDVGKMTLKVGTGDVISVVAVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRIVSQPDHYVGLTETQVVIPDDGVEDPLTYKKAMEDTDE
DKWVKAMDLEMESMYFNSVWELHDPETQEDSEEDIVVVLERNSLKKRSSKASKLTTTKEEVNKICRISERGSLSFGCAYYPRSYKDSKRSEFLKLVQGVMTVAEYHKKYI
KLSRYASTVIDDEIDRCRKFEDGLREEIGSRITASGWQEFGPLVEAAARVEKSVLEGKKQREVSENVQGSFSSTMSSSDNRKFRSNDRGFVPGTASGSGQFRRPSKSQHT
LQELESAFSYNISYSSCQKCGKLHNGQCLWGTNICYNCGQVGHMSREYPQQVHEDSVSQGASSRPVVQGCPDTQIKPFTMGSKEQINTRSSSEAKSKG