; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G019810 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G019810
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr04:10220333..10222786
RNA-Seq ExpressionCmoCh04G019810
SyntenyCmoCh04G019810
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.9e-21152.81Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARG YEYFISFIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSVLETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK        S ST+VVD A+ S QSH SQELR+PRRSGRV+ QP+RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        LGL ETQ+IIPDDGVEDP
Subjt:  LGLAETQVIIPDDGVEDP

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-20461.49Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S   RVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20761.97Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        ++AKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
         PG PQQNGVS+RRNRTLLDMVRSMMS+  LP+ FWGYAV+TA YILN VP+KSV +TP +LW GRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P  Y
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

KAA0054309.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-18958.41Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW K N+KAR YILAS+S+VLAKKH+ M TA+EIM SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYI N RM EG SVREHV++MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GE NVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG
                                     T++G+RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+  L 
Subjt:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG

Query:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK
        KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK
Subjt:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
                                    EEDH+R H PRSK+VL+E+SK  T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-18747.38Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDP
         RY+ L ET  +I D  +EDP
Subjt:  DRYLGLAETQVIIPDDGVEDP

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.1e-18747.38Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------T
                                                                                                           T
Subjt:  ---------------------------------------------------------------------------------------------------T

Query:  QQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         +G+RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  QQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------
        QLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK                              
Subjt:  QLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK------------------------------

Query:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP
           GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P
Subjt:  ---GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQP

Query:  DRYLGLAETQVIIPDDGVEDP
         RY+ L ET  +I D  +EDP
Subjt:  DRYLGLAETQVIIPDDGVEDP

A0A5A7SNP8 Gag/pol protein5.7e-20561.49Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        ++++ + +LA++KLNG+NY +WK+ +N +L+IDDL+FVL E+CP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVA  NEAVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CF CN+ GHWKRNCPKYLA+KK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        +RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW G K                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S   RVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

A0A5A7U869 Gag/pol protein5.5e-20861.97Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW KAN+KAR YILAS+S+VLAKKH+ M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYIYN RM EG SVREHVL+MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG
        Q GEANVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K         T +G
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK---------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
        ++AKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+N L KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
         PG PQQNGVS+RRNRTLLDMVRSMMS+  LP+ FWGYAV+TA YILN VP+KSV +TP +LW GRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPK T+GG FYDP++N+VFVSTNATFLEEDH+R H+PRSK+VL+E+SKE T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P  Y
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

A0A5A7ULH1 Gag/pol protein4.4e-18958.41Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        MT++ + +LA++KLNG+NY +WK+ +NT+L+IDDL+FVL EECP  P +NA RT R+ Y+RW K N+KAR YILAS+S+VLAKKH+ M TA+EIM SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ S+ ++H+A+KYI N RM EG SVREHV++MMVHFNVAE N AVIDE SQVSFI+ SLP+SF QFR+N VMNKI Y LT LLNELQT++SL+  KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------
        Q GE NVA S +K  RGS+S  KS PS+S +   KKK  G+G K  +   +  K  KA KG CFHCN+ GHWKRNCPKYLAEKK  K             
Subjt:  QTGEANVAIS-KKLLRGSSSQNKSGPSTSKSVLMKKK--GKG-KNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-------------

Query:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG
                                     T++G+RAKE LELVH+DLCGPMNVKARG +EYFI+F DDYSRYGY+YLM HKSEALEKF+EYK EV+  L 
Subjt:  -----------------------------TQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLG

Query:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK
        KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAPG PQQNGVSERRNRTLLDMVRSMMS+A LP+ FWGYAV+TA YILN VP+KSV ETP +LW GRK
Subjt:  KTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
                                    EEDH+R H PRSK+VL+E+SK  T+ +TRVV++ S  TRVV    +S ++H  Q LR PRRSGRV   P RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        + L ET  +I D  +EDP
Subjt:  LGLAETQVIIPDDGVEDP

E2GK51 Gag/pol protein (Fragment)2.4e-21152.81Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK                
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TQQG
                                                                                                        T +G
Subjt:  ------------------------------------------------------------------------------------------------TQQG

Query:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS
         RAK  LELVH+DLCGPMNVKARG YEYFISFIDD+SRYG++YL+HHKSE+ EKF+EYK EV+N +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLS
Subjt:  YRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLS

Query:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------
        AP  PQQNGVSERRNRTLLDMVRSMMS+AQLPD FWGYA+ETA +ILN VP+KSVLETPYELWKGRK                                 
Subjt:  APGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLETPYELWKGRK---------------------------------

Query:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY
        GYPKE++GGLFY PQEN+VFVSTNATFLEEDH RNHQPRSK+VL E+ K ATDK        S ST+VVD A+ S QSH SQELR+PRRSGRV+ QP+RY
Subjt:  GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQPDRY

Query:  LGLAETQVIIPDDGVEDP
        LGL ETQ+IIPDDGVEDP
Subjt:  LGLAETQVIIPDDGVEDP

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-2835.06Show/hide
Query:  KETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG
        K  L +VH+D+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + +      +  L  D G EY+    + + ++ GI   L+ P 
Subjt:  KETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPG

Query:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLE---TPYELWKGRKGYPKETK
         PQ NGVSER  RT+ +  R+M+S A+L   FWG AV TATY++N +P++++++   TPYE+W  +K Y K  +
Subjt:  MPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSVLE---TPYELWKGRKGYPKETK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-3631.25Show/hide
Query:  RAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA
        R    L+LV++D+CGPM +++ G  +YF++FIDD SR  ++Y++  K +  + F+++   V+   G+ +K LRSD GGEY    F++Y   HGIR + + 
Subjt:  RAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSA

Query:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYELWKGRK---------------------------------
        PG PQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA Y++N  P+  +  E P  +W  ++                                 
Subjt:  PGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYELWKGRK---------------------------------

Query:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI-------------SKEATDKTTRVVDQASPSTRVVDGADTSGQ-----SHP
          GY  E  G   +DP + +V  S +  F  E  VR     S+ V + I                A   T  V +Q      V++  +   +      HP
Subjt:  --GYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEI-------------SKEATDKTTRVVDQASPSTRVVDGADTSGQ-----SHP

Query:  SQ--ELRMP-RRSGRVITQPDRYLGLAETQVIIPDD
        +Q  E   P RRS R   +  RY   +   V+I DD
Subjt:  SQ--ELRMP-RRSGRVITQPDRYLGLAETQVIIPDD

Q12490 Transposon Ty1-BL Gag-Pol polyprotein2.1e-1529.07Show/hide
Query:  CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSE--ALEKFREYKTEVQNLLGKTIKTLRSD
        CP  L  K  +    +G R K     E  + +HTD+ GP++   +    YFISF D+ +++ ++Y +H + E   L+ F      ++N    ++  ++ D
Subjt:  CPKYLAEKKAEKTQ-QGYRAK-----ETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSE--ALEKFREYKTEVQNLLGKTIKTLRSD

Query:  RGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN
        RG EY +     ++ ++GI    +     + +GV+ER NRTLLD  R+ +  + LP+  W  A+E +T + N
Subjt:  RGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-2131.49Show/hide
Query:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY
        +C   L  K  K   +Q    +   LE +++D+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  ++N     I T  SD GGE+
Subjt:  NCPKYLAEK--KAEKTQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEY

Query:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE
        + L   +Y  +HGI    S P  P+ NG+SER++R +++   +++S A +P  +W YA   A Y++N +PT  + LE+P++
Subjt:  MDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-2237.96Show/hide
Query:  YEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSM
        Y Y++ F+D ++RY +LY +  KS+  + F  +K+ V+N     I TL SD GGE++ LR  DY+ +HGI    S P  P+ NG+SER++R +++M  ++
Subjt:  YEYFISFIDDYSRYGYLYLMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSM

Query:  MSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE
        +S A +P  +W YA   A Y++N +PT  + L++P++
Subjt:  MSFAQLPDPFWGYAVETATYILNMVPTKSV-LETPYE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAATTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATATGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAAACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAAAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGTGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTAGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAA
CCCGATCGTTACTTGGGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAATTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATATGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAAACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAAAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGTTATAGAGCCAAAGAAACCT
TAGAACTCGTGCATACAGATCTCTGTGGTCCAATGAATGTCAAAGCACGAGGAGTGTATGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGGCTATCTTTAC
CTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTCAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGAGGAGA
GTACATGGATTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGGTATGCCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAA
CTCTATTAGACATGGTTCGGTCTATGATGAGTTTCGCTCAATTACCCGATCCATTTTGGGGATATGCAGTGGAGACTGCTACATACATTTTGAACATGGTTCCTACTAAG
AGTGTTTTAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGATACCCCAAAGAGACGAAAGGTGGTTTGTTTTATGATCCTCAAGAAAATAGAGTGTTTGTATCAAC
GAACGCTACATTCTTAGAGGAAGACCACGTAAGAAATCATCAACCTCGTAGCAAACTAGTATTAAGTGAGATTTCTAAAGAAGCTACTGATAAAACAACAAGAGTTGTTG
ATCAAGCTAGTCCTTCAACCAGAGTTGTTGATGGAGCTGACACTTCTGGTCAATCACATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGAGGGTTATAACTCAA
CCCGATCGTTACTTGGGTTTGGCAGAAACTCAAGTCATCATACCTGATGATGGCGTTGAGGATCCATGA
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNMVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQ
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGYRAKETLELVHTDLCGPMNVKARGVYEYFISFIDDYSRYGYLY
LMHHKSEALEKFREYKTEVQNLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPGMPQQNGVSERRNRTLLDMVRSMMSFAQLPDPFWGYAVETATYILNMVPTK
SVLETPYELWKGRKGYPKETKGGLFYDPQENRVFVSTNATFLEEDHVRNHQPRSKLVLSEISKEATDKTTRVVDQASPSTRVVDGADTSGQSHPSQELRMPRRSGRVITQ
PDRYLGLAETQVIIPDDGVEDP