; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G019710 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G019710
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
Genome locationCma_Chr04:11285502..11288552
RNA-Seq ExpressionCmaCh04G019710
SyntenyCmaCh04G019710
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]9.9e-15846.01Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVLTEECP  P+ NANRTVR+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+E MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLD
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAISK-KLLRGSSSKNKSGSSTSKSVLM
        MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT+LLNELQ +Q+L  +KG+  EA VA++K K +RGSSSKNK G S ++   M
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAISK-KLLRGSSSKNKSGSSTSKSVLM

Query:  KKKGKVKNKIPTNRNKVQK-TDKGKCFHCNENGHWKRIYPKYLDEKKVEK--------------------------------------------------
        KKKGK K     N +KV+K  DKGKCFHCN++GHWKR  PKYL EKK EK                                                  
Subjt:  KKKGKVKNKIPTNRNKVQK-TDKGKCFHCNENGHWKRIYPKYLDEKKVEK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
                                                                      T +G RAK  LEL+ +DL G MNVKARGGYEYFISFID
Subjt:  --------------------------------------------------------------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAEN----------------------------------LLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        D+SRYG++YL+ HK E+ EKF+EYK E EN                                  L AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAEN----------------------------------LLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYA+ET  +ILN VP KSV ETPYELWKGRK                                 GYPKE++ GLFY PQEN+VF  TNATFLEEDH R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
        NHQPRSK+VL E+ K ATDK +        ST+VVD A+ S QSH SQELR+PRRSGRV+ QP+RYL L ETQ+IIPDDG+EDPLTYKQ
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-16055.69Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL E+CP  P++NA RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MMVHFNVA  NE VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CF CN+ GHWKR  PKYL +KK  K         T +G+RAK+ LEL+ +DL G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLL----------------------------------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        DYSRYGY+YLM+HK EALEKF+EYK E EN L                                  AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLL----------------------------------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYAV+T  YILN VP KSVSETP +LW G K                                 GYPK  + G FYDP++N+VF  TNATFLEEDH+R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
         H+PRSK+VL+E+SKE T+ +TRVV++     RVV    +S ++H  Q LR PRRSGRV   P RY+SL ET  +I D  IEDPLT+K+
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

KAA0032020.1 gag/pol protein [Cucumis melo var. makuwa]7.3e-16962.45Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP   + NA RTVR+ YD W KAN+KAR YILAS+S VLAKKH++M TA EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        M+VHFNVAE N  VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S S   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CFHCN+ GHWKR  PKYL E K  K         T +G+RAK+ LEL+ +DL GLMNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG
        DYSRYGY+YLM+HK +ALEKF+EYK E EN L PG PQQNGVSERRNRTLLDMV SMMS+A LP+ FWGYAV+T  YILN VP KSVSETP +LW GRKG
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG

Query:  YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYL
        YPK  +   FYDP++N+VF  TNATFLE+DH+R H+P +K+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P RY+
Subjt:  YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYL

Query:  SLAETQVIIPDDGIEDPLTYKQ
        SL ET  +I D  IEDPLT+K+
Subjt:  SLAETQVIIPDDGIEDPLTYKQ

KAA0037371.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-15255.5Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP   ++NA RTVR+AY+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MM+HFNVAE  E VIDE SQ++                       Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S NK   S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N      +K  K  KG CFHCN+ GHWKR  PKYL EKK  K         T +G+R K+ LEL+ ++L G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG
        DYSRYGY+YLM+HK EALEKF+EYK E EN L PG PQQNGVSERRNRTLLDMVRS+MS+ +LP+ FWGYAV+T  YILN VP KSVSETP +LW GRKG
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG

Query:  ---------------------------------YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRV
                                         YPK  + G FYDP++N+VF  TNATFLEED++R H+PRSK+VL+E+SKE T+ +TRVV++    TRV
Subjt:  ---------------------------------YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRV

Query:  VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
        V    +  ++H  Q LR PRRSGRV   P RY+SL ET  +I D  IEDPLT+K+
Subjt:  VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-16255.69Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP  P++NA RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MMVHFNVAE N  VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CFHCN+ GHWKR  PKYL EKK  K         T +G++AK+ LEL+ +DL G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLA----------------------------------PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        DYSRYGY+YLM+HK EALEKF+EYK E EN L+                                  PG PQQNGVS+RRNRTLLDMVRSMMS+  LP+ 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLA----------------------------------PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYAV+T  YILN VP KSVS+TP +LW GRK                                 GYPK  + G FYDP++N+VF  TNATFLEEDH+R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
         H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P  Y+SL ET  +I D  IEDPLT+K+
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMN4 Gag/pol protein3.5e-16962.45Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP   + NA RTVR+ YD W KAN+KAR YILAS+S VLAKKH++M TA EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        M+VHFNVAE N  VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S S   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CFHCN+ GHWKR  PKYL E K  K         T +G+RAK+ LEL+ +DL GLMNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG
        DYSRYGY+YLM+HK +ALEKF+EYK E EN L PG PQQNGVSERRNRTLLDMV SMMS+A LP+ FWGYAV+T  YILN VP KSVSETP +LW GRKG
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG

Query:  YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYL
        YPK  +   FYDP++N+VF  TNATFLE+DH+R H+P +K+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P RY+
Subjt:  YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYL

Query:  SLAETQVIIPDDGIEDPLTYKQ
        SL ET  +I D  IEDPLT+K+
Subjt:  SLAETQVIIPDDGIEDPLTYKQ

A0A5A7SNP8 Gag/pol protein7.9e-16155.69Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL E+CP  P++NA RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MMVHFNVA  NE VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CF CN+ GHWKR  PKYL +KK  K         T +G+RAK+ LEL+ +DL G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLL----------------------------------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        DYSRYGY+YLM+HK EALEKF+EYK E EN L                                  AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLL----------------------------------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYAV+T  YILN VP KSVSETP +LW G K                                 GYPK  + G FYDP++N+VF  TNATFLEEDH+R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
         H+PRSK+VL+E+SKE T+ +TRVV++     RVV    +S ++H  Q LR PRRSGRV   P RY+SL ET  +I D  IEDPLT+K+
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

A0A5A7T706 Gag/pol protein1.3e-15255.5Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP   ++NA RTVR+AY+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MM+HFNVAE  E VIDE SQ++                       Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S NK   S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N      +K  K  KG CFHCN+ GHWKR  PKYL EKK  K         T +G+R K+ LEL+ ++L G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG
        DYSRYGY+YLM+HK EALEKF+EYK E EN L PG PQQNGVSERRNRTLLDMVRS+MS+ +LP+ FWGYAV+T  YILN VP KSVSETP +LW GRKG
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKG

Query:  ---------------------------------YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRV
                                         YPK  + G FYDP++N+VF  TNATFLEED++R H+PRSK+VL+E+SKE T+ +TRVV++    TRV
Subjt:  ---------------------------------YPKEAKCGLFYDPQENRVF-DTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRV

Query:  VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
        V    +  ++H  Q LR PRRSGRV   P RY+SL ET  +I D  IEDPLT+K+
Subjt:  VDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

A0A5A7U869 Gag/pol protein1.9e-16255.69Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVL EECP  P++NA RTVR+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+E MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM
        MMVHFNVAE N  VIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT+LLNELQT++SL+  KGQ GEA VA S +K  RGS+S  KS  S+S +   
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAIS-KKLLRGSSSKNKSGSSTSKSVLM

Query:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
        KKK    G   N       K  K  KG CFHCN+ GHWKR  PKYL EKK  K         T +G++AK+ LEL+ +DL G MNVKARGG+EYFI+F D
Subjt:  KKK----GKVKNKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEK---------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLA----------------------------------PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        DYSRYGY+YLM+HK EALEKF+EYK E EN L+                                  PG PQQNGVS+RRNRTLLDMVRSMMS+  LP+ 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAENLLA----------------------------------PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYAV+T  YILN VP KSVS+TP +LW GRK                                 GYPK  + G FYDP++N+VF  TNATFLEEDH+R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
         H+PRSK+VL+E+SKE T+ +TRVV++    TRVV    +S ++H  Q LR PRRSGRV   P  Y+SL ET  +I D  IEDPLT+K+
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

E2GK51 Gag/pol protein (Fragment)4.8e-15846.01Show/hide
Query:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD
        RFVLTEECP  P+ NANRTVR+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+E MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLD
Subjt:  RFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLD

Query:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAISK-KLLRGSSSKNKSGSSTSKSVLM
        MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT+LLNELQ +Q+L  +KG+  EA VA++K K +RGSSSKNK G S ++   M
Subjt:  MMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAISK-KLLRGSSSKNKSGSSTSKSVLM

Query:  KKKGKVKNKIPTNRNKVQK-TDKGKCFHCNENGHWKRIYPKYLDEKKVEK--------------------------------------------------
        KKKGK K     N +KV+K  DKGKCFHCN++GHWKR  PKYL EKK EK                                                  
Subjt:  KKKGKVKNKIPTNRNKVQK-TDKGKCFHCNENGHWKRIYPKYLDEKKVEK--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID
                                                                      T +G RAK  LEL+ +DL G MNVKARGGYEYFISFID
Subjt:  --------------------------------------------------------------TQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFID

Query:  DYSRYGYLYLMRHKFEALEKFREYKTEAEN----------------------------------LLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP
        D+SRYG++YL+ HK E+ EKF+EYK E EN                                  L AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD 
Subjt:  DYSRYGYLYLMRHKFEALEKFREYKTEAEN----------------------------------LLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDP

Query:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR
        FWGYA+ET  +ILN VP KSV ETPYELWKGRK                                 GYPKE++ GLFY PQEN+VF  TNATFLEEDH R
Subjt:  FWGYAVETTTYILNMVPIKSVSETPYELWKGRK---------------------------------GYPKEAKCGLFYDPQENRVF-DTNATFLEEDHVR

Query:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ
        NHQPRSK+VL E+ K ATDK +        ST+VVD A+ S QSH SQELR+PRRSGRV+ QP+RYL L ETQ+IIPDDG+EDPLTYKQ
Subjt:  NHQPRSKLVLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-1528Show/hide
Query:  NGHWKRIYPKYLDEKKVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAE--------------
        NG   R+  K L +K           K+ L ++ +D+ G +         YF+ F+D ++ Y   YL+++K +    F+++  ++E              
Subjt:  NGHWKRIYPKYLDEKKVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAE--------------

Query:  --------------------NLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSV---SETPYELWKGRKGYPKEAK
                            +L  P  PQ NGVSER  RT+ +  R+M+S A+L   FWG AV T TY++N +P +++   S+TPYE+W  +K Y K  +
Subjt:  --------------------NLLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSV---SETPYELWKGRKGYPKEAK

P0C2J0 Transposon Ty1-PR2 Gag-Pol polyprotein4.5e-0425.64Show/hide
Query:  KTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLY-------------------LMRHKFEA------LEKFREYKT-------EAE
        K Q  Y   Q L    TD++G ++   +    YFISF D+ +++ ++Y                    ++++F+A      +++  EY         E +
Subjt:  KTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLY-------------------LMRHKFEA------LEKFREYKT-------EAE

Query:  NLLAP-----GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILN
        N + P        + +GV+ER NRTLLD  R+ +  + LP+  W  A+E +T + N
Subjt:  NLLAP-----GMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-1725.37Show/hide
Query:  RAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAE----------------------------------NLLA
        R    L+L+ +D+ G M +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++    E                                      
Subjt:  RAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAE----------------------------------NLLA

Query:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVP-IKSVSETPYELWKGRK---------------GYPKEAKCGL--------
        PG PQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+T  Y++N  P +    E P  +W  ++                 PKE +  L        
Subjt:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVP-IKSVSETPYELWKGRK---------------GYPKEAKCGL--------

Query:  ------------FYDPQENRVFDTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSGQ--SHPS
                     +DP + +V  +      E  VR     S+ V + I                ++  TD+ +   +Q G      +  D   +   HP+
Subjt:  ------------FYDPQENRVFDTNATFLEEDHVRNHQPRSKLVLSEI----------------SKEATDKTTRVVDQAGPSTRVVDGADTSGQ--SHPS

Query:  Q--ELRMP-RRSGRVITQPDRYLSLAETQVIIPDD
        Q  E   P RRS R   +  RY S     V+I DD
Subjt:  Q--ELRMP-RRSGRVITQPDRYLSLAETQVIIPDD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.3e-1023.21Show/hide
Query:  KVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAENLL--------------------------
        KV  +Q    + + LE + +D++    + +   Y Y++ F+D ++RY +LY ++ K +  E F  +K   EN                            
Subjt:  KVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAENLL--------------------------

Query:  ------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVS-ETPYE
               P  P+ NG+SER++R +++   +++S A +P  +W YA     Y++N +P   +  E+P++
Subjt:  ------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVS-ETPYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0822.93Show/hide
Query:  KVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAENLL--------------------------
        KV  +     + + LE + +D++    + +   Y Y++ F+D ++RY +LY ++ K +  + F  +K+  EN                            
Subjt:  KVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAENLL--------------------------

Query:  ------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVP
               P  P+ NG+SER++R +++M  +++S A +P  +W YA     Y++N +P
Subjt:  ------APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGACCTTGTGTCGCCCTGGGAGCGTCCTCCATTCGGAAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCCAGCTCAAATGCAAACCGAACAGTTCGAGATGC
GTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTA
TGGAATCTCTAAAAGAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAAGGGACCTCAGTTAGAGAA
CATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGTTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTT
CCAGTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAACTTGACTTCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACAAACAGGAG
AAGCAATTGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTCTGGATCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGTAAAGTGAAA
AATAAGATTCCTACTAATCGCAACAAGGTTCAAAAAACAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAATCTACCCAAAATACCTTGATGA
GAAGAAAGTTGAAAAGACACAACAAGGTTATAGAGCTAAACAAACCTTAGAACTCATGCTTACAGATCTCTATGGTCTAATGAATGTCAAAGCACGAGGAGGGTATGAAT
ATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCGTCACAAGTTCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGCTGAGAAT
CTATTAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTTTGTTAGATATGGTTCGGTCTATGATGAGTTTTGCTCAATTACCCGATCCGTT
TTGGGGATATGCAGTGGAGACTACTACATACATTTTGAACATGGTTCCTATTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAG
AGGCGAAATGTGGTTTGTTTTATGACCCTCAAGAAAATAGAGTGTTTGATACAAACGCTACATTTTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTA
GTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAGACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACAAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACA
TCCTTCTCAAGAGTTAAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGAGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCATTG
AGGATCCATTGACCTATAAACAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGACCTTGTGTCGCCCTGGGAGCGTCCTCCATTCGGAAGGTTTGTTTTAACTGAGGAATGTCCTCCAAACCCCAGCTCAAATGCAAACCGAACAGTTCGAGATGC
GTATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTA
TGGAATCTCTAAAAGAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAAGGGACCTCAGTTAGAGAA
CATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGTTGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTT
CCAGTTCCGCACAAATGTGGTAATGAACAAAATAGAATATAACTTGACTTCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAACAAGGGACAAACAGGAG
AAGCAATTGTTGCTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAATAAGTCTGGATCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGTAAAGTGAAA
AATAAGATTCCTACTAATCGCAACAAGGTTCAAAAAACAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAATCTACCCAAAATACCTTGATGA
GAAGAAAGTTGAAAAGACACAACAAGGTTATAGAGCTAAACAAACCTTAGAACTCATGCTTACAGATCTCTATGGTCTAATGAATGTCAAAGCACGAGGAGGGTATGAAT
ATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCGTCACAAGTTCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGCTGAGAAT
CTATTAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACTTTGTTAGATATGGTTCGGTCTATGATGAGTTTTGCTCAATTACCCGATCCGTT
TTGGGGATATGCAGTGGAGACTACTACATACATTTTGAACATGGTTCCTATTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAG
AGGCGAAATGTGGTTTGTTTTATGACCCTCAAGAAAATAGAGTGTTTGATACAAACGCTACATTTTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTA
GTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAGACAACAAGAGTTGTTGATCAAGCTGGTCCTTCAACAAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACA
TCCTTCTCAAGAGTTAAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAACCCGATCGTTACTTGAGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCATTG
AGGATCCATTGACCTATAAACAGCAATGA
Protein sequenceShow/hide protein sequence
MLDLVSPWERPPFGRFVLTEECPPNPSSNANRTVRDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKEGMFGQPSFSLRHEAIKYIYNCRMKEGTSVRE
HVLDMMVHFNVAEENEVVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTSLLNELQTYQSLLTNKGQTGEAIVAISKKLLRGSSSKNKSGSSTSKSVLMKKKGKVK
NKIPTNRNKVQKTDKGKCFHCNENGHWKRIYPKYLDEKKVEKTQQGYRAKQTLELMLTDLYGLMNVKARGGYEYFISFIDDYSRYGYLYLMRHKFEALEKFREYKTEAEN
LLAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETTTYILNMVPIKSVSETPYELWKGRKGYPKEAKCGLFYDPQENRVFDTNATFLEEDHVRNHQPRSKL
VLSEISKEATDKTTRVVDQAGPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRYLSLAETQVIIPDDGIEDPLTYKQQ