; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G24440 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G24440
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr3:21507816..21509974
RNA-Seq ExpressionCSPI03G24440
SyntenyCSPI03G24440
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.2e-18453.26Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        M+DVLAKKH+S+A AK IMDSL+EMFGQP WSLRHEA+K+IYTKRMKEGTSVR+HVL MMMHF IAEVNGGPI+E NQVSFIL+SL KSF+PFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEK-----------------------
        KIEFN+TTLLNELQ+FQ       K+VEANVA TKR F +GSSSK+  GP   KAQMK KG  K P T+K K+  +K                       
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEK-----------------------

Query:  -------------------VEYDTSTWILDSGATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE--------------------------------
                           VE D STWILDSGATNHICFSFQE+ SWKKL E E TLKVG  EVVS E                                
Subjt:  -------------------VEYDTSTWILDSGATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE--------------------------------

Query:  ---------------------------------------------------------------------------------------------PLEDNSL
                                                                                                      LEDNSL
Subjt:  ---------------------------------------------------------------------------------------------PLEDNSL

Query:  SPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGE
         PCE          SFT K LRAKVPLELVHSDLCGPMNVKAR GYEYFISFID +SRYGHVYL+HHKS+S EKFKEYKA+VENE+GKTIK LRSDRGGE
Subjt:  SPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGE

Query:  YVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWN--
        Y+D +FQDYLIE GIQ QLSAPSTPQQNGVSERR+RTLLDMV SMMS+AQLPDSFWGYALET+ H+++       +  P E    R     +  R W   
Subjt:  YVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWN--

Query:  ---IIRN-----------YTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQELRE
           +++N                  +SRGGLF +PQ+NKVFVSTNATFLEE H +N+QPRSK+VL E+  NAT+ PSSSTKVVDK   S QSH  QELR 
Subjt:  ---IIRN-----------YTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQELRE

Query:  SQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
         + SGRVVHQ +RYLGL ETQ++IPDDG+EDPLTYK
Subjt:  SQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

KAA0040307.1 gag/pol protein [Cucumis melo var. makuwa]7.3e-14062.69Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        MSDVLAKKHESLA AKEIMDSLK MFGQP+W LRHEA+KYIYTKRMKEGTSVR+HVL MMMHF IAEVN G I+E NQVSFILESL KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE-----------YDTSTWILDS
        KIEFN+TTLLNELQ+FQ       K+VE NVATTK  F +GSSSKS SGP KP  +++ KG  KTPK NKGK+  EK +            +   ++   
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE-----------YDTSTWILDS

Query:  GATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE-------------------------------PLEDNSLSPCE----------SFTEKCLRAKV
         A       + E+ SWK+L E E TLKVG  E+VS +                                LEDNSL PC+          SFT K L+AK 
Subjt:  GATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE-------------------------------PLEDNSLSPCE----------SFTEKCLRAKV

Query:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTP
        PLEL+HS+LC PMNVKAR GYEYFI+FID YSRYGHVYL+ +KSDS EKFKEYKA+VENE GK IK LRSDRGGEY+DLRFQDYLIE+GIQ QLSAPSTP
Subjt:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTP

Query:  QQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALET-------------SKHLMSFGKGV
        Q NGVSERR+RTLLDMV SMMSFAQLPDSFWGYALET             S+ LMS+GKG+
Subjt:  QQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALET-------------SKHLMSFGKGV

KAA0054875.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-15660.22Show/hide
Query:  MKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLNKIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSK
        M EGTSVR+HVL MMMHF IAEVNGG I+E NQVSFILESL KSFIPFQTNVSLNKIEFN+TTLLNELQ+FQ       K+VEANVATTKR FS+G+SSK
Subjt:  MKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLNKIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSK

Query:  STSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE----YDTSTWILD---------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS--------
        S +GP KP  +++ KG  KTPK NK K+ TEK +     +   W+ +         +       +   E+ SWK+L E E TL+VG  E+VS        
Subjt:  STSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE----YDTSTWILD---------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS--------

Query:  ---------VEPLEDNSLSPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKV
                 +  LEDNSL PC+          SFT K L AK  LELVHSDLCGPMNVKAR GYEYFISFID Y+RYGHVYL+ +KSDS EKFKEYKA+V
Subjt:  ---------VEPLEDNSLSPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKV

Query:  ENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF
        ENE GKTIK  RSDRGGEY+DLRFQDYLIE+ IQ QLSAPSTPQQNGVSERR++TLLDMV SMMSF QLPDSF GYALE + ++++          P E 
Subjt:  ENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF

Query:  GDARHMCWYKILRNWNIIRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQEL
                      W   ++Y      +S+GGLF +PQ+NK F+S  ATFLEE HI+N+Q RSKLV  EIS N T+ PSSSTKVVDKT K GQ+H  QEL
Subjt:  GDARHMCWYKILRNWNIIRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQEL

Query:  RESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
         E +HSGRVV Q DRYLGL+E Q+VIPDDGIEDPLTYK
Subjt:  RESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

KAA0061924.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-13553.01Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        M+DVLAKKHESLA AKEI+D+LK MFGQ +W LRHEA+KYIYTKRMKEGT VR+HVL MMMH  I EVNGG I+E NQVSFILESL KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEYDTSTWILDSGATNHICFSFQ
        KIEFN+T LLNELQ+FQT      KQVEANVATTK  FS+GSSSK+ +GP KPK Q+K KG  KTPK NKGK+A EK +                C+   
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEYDTSTWILDSGATNHICFSFQ

Query:  ESCSW-----KKLVEHETTLKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSL
        ++  W     K LVE +   +   +   S             SFT K LRAK+P+ELVHS LCGPMNVKAR G                           
Subjt:  ESCSW-----KKLVEHETTLKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSL

Query:  EKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS---------
                                                           TP+QNGVSERR+RTLLDMV SMMSFAQLPDSFWGYALET+         
Subjt:  EKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS---------

Query:  ----KHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWNI---IRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNA
            KHLMS+GKGVK+VY+ LEF D RH CWYK L+NWNI   +    Y                 KVFVSTN +FLEE HI+++QP SKLVL+EIS +A
Subjt:  ----KHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWNI---IRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNA

Query:  TNIPSSSTKVVDKTWKSGQSHPFQELRESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
        ++ PSSSTKVVDK+  S Q+HP QELRE + SGRVVHQ +RYLGL+ET +VIPDDGIED LTYK
Subjt:  TNIPSSSTKVVDKTWKSGQSHPFQELRESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

KAA0063246.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-13964Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        MSDVLAKKHESLA  KEIMDSLK MFGQPKWSLRHEA+KYIYTKRMKE TSVR+HVL  MMHF IAEVNGG I+E NQVSFILES  KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEY----DTSTWILD--------
        KIEFN+TTLLNELQ+FQ       K+VEANVA+TK  F +GSSSKS + P K   +++ KG  KTPK  KGK+ TEK +     +   W+ +        
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEY----DTSTWILD--------

Query:  ------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS-------------------------------VEPLEDNSLSPCE----------SFTE
               GATNHICFSFQ++ SWK+L E E TLKVG  E+VS                               +  LEDNSL PC+          SFT 
Subjt:  ------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS-------------------------------VEPLEDNSLSPCE----------SFTE

Query:  KCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQ
        K LRAK PLELVHSD  GPMNVKAR GY+YFISFID YSRY HVYL+ +KSDS EKFKEYKAKVENELGKTIK LRSDRG EY+DLRFQDYLIE  IQ Q
Subjt:  KCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQ

Query:  LSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS
        LSAP+TPQQN VSERR+RTLLDMVHSMMSFAQLP+SFW YALET+ ++++
Subjt:  LSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS

TrEMBL top hitse value%identityAlignment
A0A5A7TFI0 Gag/pol protein3.5e-14062.69Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        MSDVLAKKHESLA AKEIMDSLK MFGQP+W LRHEA+KYIYTKRMKEGTSVR+HVL MMMHF IAEVN G I+E NQVSFILESL KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE-----------YDTSTWILDS
        KIEFN+TTLLNELQ+FQ       K+VE NVATTK  F +GSSSKS SGP KP  +++ KG  KTPK NKGK+  EK +            +   ++   
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE-----------YDTSTWILDS

Query:  GATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE-------------------------------PLEDNSLSPCE----------SFTEKCLRAKV
         A       + E+ SWK+L E E TLKVG  E+VS +                                LEDNSL PC+          SFT K L+AK 
Subjt:  GATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE-------------------------------PLEDNSLSPCE----------SFTEKCLRAKV

Query:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTP
        PLEL+HS+LC PMNVKAR GYEYFI+FID YSRYGHVYL+ +KSDS EKFKEYKA+VENE GK IK LRSDRGGEY+DLRFQDYLIE+GIQ QLSAPSTP
Subjt:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTP

Query:  QQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALET-------------SKHLMSFGKGV
        Q NGVSERR+RTLLDMV SMMSFAQLPDSFWGYALET             S+ LMS+GKG+
Subjt:  QQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALET-------------SKHLMSFGKGV

A0A5A7UMM9 Gag/pol protein2.7e-15660.22Show/hide
Query:  MKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLNKIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSK
        M EGTSVR+HVL MMMHF IAEVNGG I+E NQVSFILESL KSFIPFQTNVSLNKIEFN+TTLLNELQ+FQ       K+VEANVATTKR FS+G+SSK
Subjt:  MKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLNKIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSK

Query:  STSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE----YDTSTWILD---------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS--------
        S +GP KP  +++ KG  KTPK NK K+ TEK +     +   W+ +         +       +   E+ SWK+L E E TL+VG  E+VS        
Subjt:  STSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVE----YDTSTWILD---------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS--------

Query:  ---------VEPLEDNSLSPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKV
                 +  LEDNSL PC+          SFT K L AK  LELVHSDLCGPMNVKAR GYEYFISFID Y+RYGHVYL+ +KSDS EKFKEYKA+V
Subjt:  ---------VEPLEDNSLSPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKV

Query:  ENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF
        ENE GKTIK  RSDRGGEY+DLRFQDYLIE+ IQ QLSAPSTPQQNGVSERR++TLLDMV SMMSF QLPDSF GYALE + ++++          P E 
Subjt:  ENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF

Query:  GDARHMCWYKILRNWNIIRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQEL
                      W   ++Y      +S+GGLF +PQ+NK F+S  ATFLEE HI+N+Q RSKLV  EIS N T+ PSSSTKVVDKT K GQ+H  QEL
Subjt:  GDARHMCWYKILRNWNIIRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQEL

Query:  RESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
         E +HSGRVV Q DRYLGL+E Q+VIPDDGIEDPLTYK
Subjt:  RESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

A0A5A7V9X9 Gag/pol protein6.0e-14064Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        MSDVLAKKHESLA  KEIMDSLK MFGQPKWSLRHEA+KYIYTKRMKE TSVR+HVL  MMHF IAEVNGG I+E NQVSFILES  KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEY----DTSTWILD--------
        KIEFN+TTLLNELQ+FQ       K+VEANVA+TK  F +GSSSKS + P K   +++ KG  KTPK  KGK+ TEK +     +   W+ +        
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEY----DTSTWILD--------

Query:  ------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS-------------------------------VEPLEDNSLSPCE----------SFTE
               GATNHICFSFQ++ SWK+L E E TLKVG  E+VS                               +  LEDNSL PC+          SFT 
Subjt:  ------SGATNHICFSFQESCSWKKLVEHETTLKVGMREVVS-------------------------------VEPLEDNSLSPCE----------SFTE

Query:  KCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQ
        K LRAK PLELVHSD  GPMNVKAR GY+YFISFID YSRY HVYL+ +KSDS EKFKEYKAKVENELGKTIK LRSDRG EY+DLRFQDYLIE  IQ Q
Subjt:  KCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQ

Query:  LSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS
        LSAP+TPQQN VSERR+RTLLDMVHSMMSFAQLP+SFW YALET+ ++++
Subjt:  LSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS

A0A5D3DK93 Gag/pol protein4.5e-13553.01Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        M+DVLAKKHESLA AKEI+D+LK MFGQ +W LRHEA+KYIYTKRMKEGT VR+HVL MMMH  I EVNGG I+E NQVSFILESL KSFIPFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEYDTSTWILDSGATNHICFSFQ
        KIEFN+T LLNELQ+FQT      KQVEANVATTK  FS+GSSSK+ +GP KPK Q+K KG  KTPK NKGK+A EK +                C+   
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEYDTSTWILDSGATNHICFSFQ

Query:  ESCSW-----KKLVEHETTLKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSL
        ++  W     K LVE +   +   +   S             SFT K LRAK+P+ELVHS LCGPMNVKAR G                           
Subjt:  ESCSW-----KKLVEHETTLKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSL

Query:  EKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS---------
                                                           TP+QNGVSERR+RTLLDMV SMMSFAQLPDSFWGYALET+         
Subjt:  EKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS---------

Query:  ----KHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWNI---IRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNA
            KHLMS+GKGVK+VY+ LEF D RH CWYK L+NWNI   +    Y                 KVFVSTN +FLEE HI+++QP SKLVL+EIS +A
Subjt:  ----KHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWNI---IRNYTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNA

Query:  TNIPSSSTKVVDKTWKSGQSHPFQELRESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
        ++ PSSSTKVVDK+  S Q+HP QELRE + SGRVVHQ +RYLGL+ET +VIPDDGIED LTYK
Subjt:  TNIPSSSTKVVDKTWKSGQSHPFQELRESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

E2GK51 Gag/pol protein (Fragment)1.1e-18453.26Show/hide
Query:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN
        M+DVLAKKH+S+A AK IMDSL+EMFGQP WSLRHEA+K+IYTKRMKEGTSVR+HVL MMMHF IAEVNGGPI+E NQVSFIL+SL KSF+PFQTN SLN
Subjt:  MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLN

Query:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEK-----------------------
        KIEFN+TTLLNELQ+FQ       K+VEANVA TKR F +GSSSK+  GP   KAQMK KG  K P T+K K+  +K                       
Subjt:  KIEFNVTTLLNELQKFQT-----RKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEK-----------------------

Query:  -------------------VEYDTSTWILDSGATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE--------------------------------
                           VE D STWILDSGATNHICFSFQE+ SWKKL E E TLKVG  EVVS E                                
Subjt:  -------------------VEYDTSTWILDSGATNHICFSFQESCSWKKLVEHETTLKVGMREVVSVE--------------------------------

Query:  ---------------------------------------------------------------------------------------------PLEDNSL
                                                                                                      LEDNSL
Subjt:  ---------------------------------------------------------------------------------------------PLEDNSL

Query:  SPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGE
         PCE          SFT K LRAKVPLELVHSDLCGPMNVKAR GYEYFISFID +SRYGHVYL+HHKS+S EKFKEYKA+VENE+GKTIK LRSDRGGE
Subjt:  SPCE----------SFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGE

Query:  YVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWN--
        Y+D +FQDYLIE GIQ QLSAPSTPQQNGVSERR+RTLLDMV SMMS+AQLPDSFWGYALET+ H+++       +  P E    R     +  R W   
Subjt:  YVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWN--

Query:  ---IIRN-----------YTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQELRE
           +++N                  +SRGGLF +PQ+NKVFVSTNATFLEE H +N+QPRSK+VL E+  NAT+ PSSSTKVVDK   S QSH  QELR 
Subjt:  ---IIRN-----------YTYSLVTQSRGGLFCNPQDNKVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQELRE

Query:  SQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK
         + SGRVVHQ +RYLGL ETQ++IPDDG+EDPLTYK
Subjt:  SQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-2333.16Show/hide
Query:  LKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKV----------------PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKF
        L++  + + S + L +N    CE   E CL  K                 PL +VHSD+CGP+         YF+ F+D ++ Y   YL+ +KSD    F
Subjt:  LKVGMREVVSVEPLEDNSLSPCESFTEKCLRAKV----------------PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKF

Query:  KEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS
        +++ AK E      +  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +   +M+S A+L  SFWG A+ T+ +L++
Subjt:  KEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-3142.31Show/hide
Query:  LELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQ
        L+LV+SD+CGPM +++  G +YF++FID  SR   VY++  K    + F+++ A VE E G+ +K LRSD GGEY    F++Y   +GI+ + + P TPQ
Subjt:  LELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPSTPQ

Query:  QNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF
         NGV+ER +RT+++ V SM+  A+LP SFWG A++T+ +L++     +   +PL F
Subjt:  QNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein8.4e-1430Show/hide
Query:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDS--LEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPS
        P + +H+D+ GP++   +    YFISF D  +++  VY +H + +   L+ F    A ++N+   ++ +++ DRG EY +     +L +NGI    +  +
Subjt:  PLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDS--LEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGIQLQLSAPS

Query:  TPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS
          + +GV+ER +RTLLD   + +  + LP+  W  A+E S
Subjt:  TPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1731.37Show/hide
Query:  FTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGI
        F++  + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I    SD GGE+V L   +Y  ++GI
Subjt:  FTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGI

Query:  QLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS
            S P TP+ NG+SER+ R +++   +++S A +P ++W YA   + +L++
Subjt:  QLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-1934.64Show/hide
Query:  FTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGI
        F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I  L SD GGE+V LR  DYL ++GI
Subjt:  FTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIENGI

Query:  QLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS
            S P TP+ NG+SER+ R +++M  +++S A +P ++W YA   + +L++
Subjt:  QLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGTTTTGGCAAAGAAACATGAATCCTTAGCTAAGGCTAAGGAGATTATGGATTCATTAAAGGAAATGTTTGGGCAACCAAAATGGTCCTTAAGACACGAGGC
AGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAAAACATGTTCTGGTCATGATGATGCACTTCAAGATCGCCGAAGTAAATGGTGGTCCCATCG
AAGAGGTTAATCAAGTTAGCTTTATTTTAGAGTCTCTTCTAAAGAGCTTCATTCCATTTCAGACAAATGTGTCTTTAAACAAGATAGAGTTTAACGTGACAACCCTTCTG
AATGAGCTTCAAAAGTTCCAAACTCGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAGCTTTTCAAAAGGATCGTCCTCTAAAAGCACAAGTGGACCTTATAA
GCCTAAAGCTCAAATGAAGATGAAGGGAACGAGGAAGACTCCCAAGACGAACAAGGGAAAGAGAGCCACAGAAAAAGTCGAATATGATACTTCAACCTGGATACTAGATT
CAGGAGCCACTAATCATATTTGCTTCTCATTTCAGGAATCTTGTTCTTGGAAGAAGCTTGTAGAACATGAGACTACTCTCAAGGTTGGAATGAGAGAGGTAGTCTCAGTT
GAACCATTAGAAGATAACTCTCTATCTCCATGTGAATCTTTTACTGAAAAATGTCTTAGAGCCAAAGTACCTTTAGAGCTTGTGCATTCGGACCTTTGTGGACCAATGAA
TGTCAAGGCTCGAGTAGGGTACGAATATTTCATCAGTTTCATTGATTGTTATTCAAGGTATGGTCATGTTTACCTAATGCATCATAAGTCTGATTCTCTTGAAAAGTTCA
AAGAATACAAGGCTAAAGTTGAGAACGAATTAGGAAAAACAATAAAGATACTTCGATCAGATCGAGGTGGAGAGTATGTGGACTTGAGATTCCAAGATTATTTGATAGAA
AATGGAATTCAGTTACAACTCTCTGCACCTAGTACGCCTCAACAGAACGGTGTATCAGAAAGAAGAAGCCGAACCTTGTTAGACATGGTTCACTCTATGATGAGCTTTGC
TCAATTGCCAGATTCTTTTTGGGGATATGCTTTAGAGACATCTAAACACCTTATGAGCTTTGGAAAGGGCGTAAAGGAAGTTTACATCCCTTTAGAATTTGGGGATGCCC
GACACATGTGTTGGTACAAAATCCTAAGAAATTGGAACATCATTCGAAATTATACCTATTCGTTGGTTACCCAATCAAGGGGTGGTCTCTTTTGCAATCCTCAAGATAAT
AAAGTATTTGTTTCGACAAATGCTACATTCTTGGAGGAAGGCCACATCAAGAATTATCAACCGCGCAGTAAACTAGTATTAAGTGAGATTTCTACAAATGCTACAAATAT
ACCTAGTTCATCAACTAAAGTAGTTGATAAAACTTGGAAATCTGGTCAATCACATCCTTTTCAAGAGTTGAGAGAGTCTCAACATAGTGGGAGGGTTGTTCATCAGCTTG
ATCGCTATTTGGGTTTAACAGAAACTCAACTCGTCATACCTGATGACGGCATAGAGGATCCATTAACCTATAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGTTTTGGCAAAGAAACATGAATCCTTAGCTAAGGCTAAGGAGATTATGGATTCATTAAAGGAAATGTTTGGGCAACCAAAATGGTCCTTAAGACACGAGGC
AGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAAAACATGTTCTGGTCATGATGATGCACTTCAAGATCGCCGAAGTAAATGGTGGTCCCATCG
AAGAGGTTAATCAAGTTAGCTTTATTTTAGAGTCTCTTCTAAAGAGCTTCATTCCATTTCAGACAAATGTGTCTTTAAACAAGATAGAGTTTAACGTGACAACCCTTCTG
AATGAGCTTCAAAAGTTCCAAACTCGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAGCTTTTCAAAAGGATCGTCCTCTAAAAGCACAAGTGGACCTTATAA
GCCTAAAGCTCAAATGAAGATGAAGGGAACGAGGAAGACTCCCAAGACGAACAAGGGAAAGAGAGCCACAGAAAAAGTCGAATATGATACTTCAACCTGGATACTAGATT
CAGGAGCCACTAATCATATTTGCTTCTCATTTCAGGAATCTTGTTCTTGGAAGAAGCTTGTAGAACATGAGACTACTCTCAAGGTTGGAATGAGAGAGGTAGTCTCAGTT
GAACCATTAGAAGATAACTCTCTATCTCCATGTGAATCTTTTACTGAAAAATGTCTTAGAGCCAAAGTACCTTTAGAGCTTGTGCATTCGGACCTTTGTGGACCAATGAA
TGTCAAGGCTCGAGTAGGGTACGAATATTTCATCAGTTTCATTGATTGTTATTCAAGGTATGGTCATGTTTACCTAATGCATCATAAGTCTGATTCTCTTGAAAAGTTCA
AAGAATACAAGGCTAAAGTTGAGAACGAATTAGGAAAAACAATAAAGATACTTCGATCAGATCGAGGTGGAGAGTATGTGGACTTGAGATTCCAAGATTATTTGATAGAA
AATGGAATTCAGTTACAACTCTCTGCACCTAGTACGCCTCAACAGAACGGTGTATCAGAAAGAAGAAGCCGAACCTTGTTAGACATGGTTCACTCTATGATGAGCTTTGC
TCAATTGCCAGATTCTTTTTGGGGATATGCTTTAGAGACATCTAAACACCTTATGAGCTTTGGAAAGGGCGTAAAGGAAGTTTACATCCCTTTAGAATTTGGGGATGCCC
GACACATGTGTTGGTACAAAATCCTAAGAAATTGGAACATCATTCGAAATTATACCTATTCGTTGGTTACCCAATCAAGGGGTGGTCTCTTTTGCAATCCTCAAGATAAT
AAAGTATTTGTTTCGACAAATGCTACATTCTTGGAGGAAGGCCACATCAAGAATTATCAACCGCGCAGTAAACTAGTATTAAGTGAGATTTCTACAAATGCTACAAATAT
ACCTAGTTCATCAACTAAAGTAGTTGATAAAACTTGGAAATCTGGTCAATCACATCCTTTTCAAGAGTTGAGAGAGTCTCAACATAGTGGGAGGGTTGTTCATCAGCTTG
ATCGCTATTTGGGTTTAACAGAAACTCAACTCGTCATACCTGATGACGGCATAGAGGATCCATTAACCTATAAATAG
Protein sequenceShow/hide protein sequence
MSDVLAKKHESLAKAKEIMDSLKEMFGQPKWSLRHEAVKYIYTKRMKEGTSVRKHVLVMMMHFKIAEVNGGPIEEVNQVSFILESLLKSFIPFQTNVSLNKIEFNVTTLL
NELQKFQTRKQVEANVATTKRSFSKGSSSKSTSGPYKPKAQMKMKGTRKTPKTNKGKRATEKVEYDTSTWILDSGATNHICFSFQESCSWKKLVEHETTLKVGMREVVSV
EPLEDNSLSPCESFTEKCLRAKVPLELVHSDLCGPMNVKARVGYEYFISFIDCYSRYGHVYLMHHKSDSLEKFKEYKAKVENELGKTIKILRSDRGGEYVDLRFQDYLIE
NGIQLQLSAPSTPQQNGVSERRSRTLLDMVHSMMSFAQLPDSFWGYALETSKHLMSFGKGVKEVYIPLEFGDARHMCWYKILRNWNIIRNYTYSLVTQSRGGLFCNPQDN
KVFVSTNATFLEEGHIKNYQPRSKLVLSEISTNATNIPSSSTKVVDKTWKSGQSHPFQELRESQHSGRVVHQLDRYLGLTETQLVIPDDGIEDPLTYK