; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016484 (gene) of Snake gourd v1 genome

Gene IDTan0016484
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:50248720..50250381
RNA-Seq ExpressionTan0016484
SyntenyTan0016484
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-21570.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK +LK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-21570.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK ILK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-21168.48Show/hide
Query:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------
        PRRSGR    P RYM L ET  V  D D EDPLT+ +AM DVDKDE IKAM+ E+ESMYFNSVW+LVDQ D VKPIGCKWIYKRKR  DG          
Subjt:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------

Query:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK
                              KSIRILL+I AY+DY++WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYG K+ASRSWNIRFD AIK
Subjt:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK

Query:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K++AFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE  ++LGIQI R+RKN+ LAL QASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK

Query:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----
         LLPFRHGV LSK+QCPKTPQDVE+MR IPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGL HWT VK ILK L              L++T    
Subjt:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----

Query:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE
                         F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR F+ DLEV+PNM+ PITL+CDNSGA  NSREPR+HKRGKHIE
Subjt:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE

Query:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP
        RKYHLIREI+HR DV VTQIAS HNVADPFTKPLTAKVF+GHLESLGLR++P
Subjt:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-21570.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK +LK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-21168.48Show/hide
Query:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------
        PRRSGR    P RYM L ET  V  D D EDPLT+ +AM DVDKDE IKAM+ E+ESMYFNSVW+LVDQ D VKPIGCKWIYKRKR  DG          
Subjt:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------

Query:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK
                              KSIRILL+I AY+DY++WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYG K+ASRSWNIRFD AIK
Subjt:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK

Query:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K++AFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE  ++LGIQI R+RKN+ LAL QASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK

Query:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----
         LLPFRHGV LSK+QCPKTPQDVE+MR IPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGL HWT VK ILK L              L++T    
Subjt:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----

Query:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE
                         F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR F+ DLEV+PNM+ PITL+CDNSGA  NSREPR+HKRGKHIE
Subjt:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE

Query:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP
        RKYHLIREI+HR DV VTQIAS HNVADPFTKPLTAKVF+GHLESLGLR++P
Subjt:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.5e-21168.48Show/hide
Query:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------
        PRRSGR    P RYM L ET  V  D D EDPLT+ +AM DVDKDE IKAM+ E+ESMYFNSVW+LVDQ D VKPIGCKWIYKRKR  DG          
Subjt:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------

Query:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK
                              KSIRILL+I AY+DY++WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYG K+ASRSWNIRFD AIK
Subjt:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK

Query:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K++AFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE  ++LGIQI R+RKN+ LAL QASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK

Query:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----
         LLPFRHGV LSK+QCPKTPQDVE+MR IPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGL HWT VK ILK L              L++T    
Subjt:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----

Query:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE
                         F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR F+ DLEV+PNM+ PITL+CDNSGA  NSREPR+HKRGKHIE
Subjt:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE

Query:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP
        RKYHLIREI+HR DV VTQIAS HNVADPFTKPLTAKVF+GHLESLGLR++P
Subjt:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP

A0A5A7T2V9 Gag/pol protein3.1e-21570.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK ILK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

A0A5A7TZD0 Gag/pol protein6.1e-21670.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK +LK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

A0A5A7UYE8 Gag/pol protein6.1e-21670.11Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------
        MPRRSGR V QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+ +KAMD EMESMYFNSVWELVD  + VKPIGCKWIYKRKR   G         
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG---------

Query:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI
                               KSIRILL+I  +YDY++WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYG K+ASRSWNIRFD AI
Subjt:  -----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAI

Query:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK
        KSYGFDQNVDEPCVYKKI    +AFLVLYVDDILLIGN+V +LTDVK WLA+QFQMKDLGE  Y+LGIQI+R+RKN+TLAL QA+YIDK+L RY MQNSK
Subjt:  KSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK

Query:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---
        K LLPFRHGVHLSK+Q PKTPQ+VEDMRRIPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGLDHWT VK +LK L              L++T   
Subjt:  KSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT---

Query:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI
                          F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF+ DLEV+PNMNLPITL+CDNSGA  NS+EPR+HKRGKHI
Subjt:  ------------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHI

Query:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL
        ERKYHLIREI+ R DV VT+IASEHN+ADPFTK LTAKVF+GHLESLGLR++
Subjt:  ERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLREL

A0A5D3CPJ6 Gag/pol protein3.5e-21168.48Show/hide
Query:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------
        PRRSGR    P RYM L ET  V  D D EDPLT+ +AM DVDKDE IKAM+ E+ESMYFNSVW+LVDQ D VKPIGCKWIYKRKR  DG          
Subjt:  PRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG----------

Query:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK
                              KSIRILL+I AY+DY++WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYG K+ASRSWNIRFD AIK
Subjt:  ----------------------KSIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIK

Query:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K++AFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE  ++LGIQI R+RKN+ LAL QASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK

Query:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----
         LLPFRHGV LSK+QCPKTPQDVE+MR IPYASAVGSLMY MLCTR DICYAVGIVSRYQSNPGL HWT VK ILK L              L++T    
Subjt:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNL-----------LVVT----

Query:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE
                         F     +VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR F+ DLEV+PNM+ PITL+CDNSGA  NSREPR+HKRGKHIE
Subjt:  -----------------FIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE

Query:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP
        RKYHLIREI+HR DV VTQIAS HNVADPFTKPLTAKVF+GHLESLGLR++P
Subjt:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-5829.17Show/hide
Query:  PLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWI-------------YKRKRCVDG-------------------KSIRILLAI
        P ++D+     DK    +A++ E+ +   N+ W +  + +    +  +W+             YK +    G                    S R +L++
Subjt:  PLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWI-------------YKRKRCVDG-------------------KSIRILLAI

Query:  VAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDK----TIAFLVL
        V  Y+  V QMDVKTAFLNG L E IYM  P+G         VC+L+++IYG K+A+R W   F++A+K   F  +  + C+Y  I+DK       +++L
Subjt:  VAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDK----TIAFLVL

Query:  YVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVH---LSKDQCPKTPQDVE
        YVDD+++   ++  + + K++L  +F+M DL E+ + +GI+I    +   + L Q++Y+ K+LS++ M+N      P    ++   L+ D+   T     
Subjt:  YVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVH---LSKDQCPKTPQDVE

Query:  DMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNLLVVTFI-----RMEESVV-------------------------
             P  S +G LMY+MLCTR D+  AV I+SRY S    + W  +K +L+ L   + +  I       E  ++                         
Subjt:  DMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKILGNLLVVTFI-----RMEESVV-------------------------

Query:  ------WRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIREIMHRRDVTVTQI
              W + +Q  +A S+ EAEY+A  EA +EA+WL+  +T + +   +  PI ++ DN G    +  P  HKR KHI+ KYH  RE +    + +  I
Subjt:  ------WRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIREIMHRRDVTVTQI

Query:  ASEHNVADPFTKPLTAKVFKGHLESLGL
         +E+ +AD FTKPL A  F    + LGL
Subjt:  ASEHNVADPFTKPLTAKVFKGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-8935.77Show/hide
Query:  RRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDGK----------
        RRS R   +  RY     T  V   DD  +P +  + +   +K++ +KAM +EMES+  N  ++LV+     +P+ CKW++K K+  D K          
Subjt:  RRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDGK----------

Query:  ----------------------SIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKS
                              SIR +L++ A  D +V Q+DVKTAFL+G L+E IYM+QP+GF   G++  VC+L++S+YG K+A R W ++FD  +KS
Subjt:  ----------------------SIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKS

Query:  YGFDQNVDEPCVY-KKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK
          + +   +PCVY K+  +     L+LYVDD+L++G +   +  +K  L+  F MKDLG    ILG++IVR R +R L L Q  YI+++L R+ M+N+K 
Subjt:  YGFDQNVDEPCVY-KKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKK

Query:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL----GNLLV-------------
           P    + LSK  CP T ++  +M ++PY+SAVGSLMY M+CTR DI +AVG+VSR+  NPG +HW  VK IL+ L    G+ L              
Subjt:  SLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL----GNLLV-------------

Query:  ---------------VTFIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE
                         F     ++ W+S  Q C+A ST EAEY+AA E  KE +WL++F+ +L +         ++CD+  A   S+    H R KHI+
Subjt:  ---------------VTFIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIE

Query:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGL
         +YH IRE++    + V +I++  N AD  TK +    F+   E +G+
Subjt:  RKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W9.7e-2531.65Show/hide
Query:  MDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEF
        MDV TAFLN  +DE IY+ QP GF+ +     V  L+  +YG K+A   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+     + 
Subjt:  MDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEF

Query:  LTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYV
           VK+ L   + MKDLG+V   LG+ I     N  + L    YI K  S  ++   K +  P  +    SK     T   ++D+   PY S VG L++ 
Subjt:  LTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYV

Query:  MLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL
            R DI Y V ++SR+   P   H    + +L+ L
Subjt:  MLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-5228.82Show/hide
Query:  DPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELV-DQLDRVKPIGCKWIYKRKRCVDGK--------------------------------SIRILL
        +P T  QA+ D   +    AM  E+ +   N  W+LV      V  +GC+WI+ +K   DG                                 SIRI+L
Subjt:  DPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELV-DQLDRVKPIGCKWIYKRKRCVDGK--------------------------------SIRILL

Query:  AIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIAFLVLYV
         +     + + Q+DV  AFL G L + +YM QP GFI + +   VC+L +++YG K+A R+W +     + + GF  +V +  ++     K+I ++++YV
Subjt:  AIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIAFLVLYV

Query:  DDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDVEDMRRI
        DDIL+ GN+   L +    L+ +F +KD  E+HY LGI+    R    L L Q  YI  +L+R  M  +K    P      LS     K     E     
Subjt:  DDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDVEDMRRI

Query:  PYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL------------GNLLVV-------------TFIRMEESVV--------WR
         Y   VGSL Y+   TR DI YAV  +S++   P  +H   +K IL+ L            GN L +              ++     +V        W 
Subjt:  PYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL------------GNLLVV-------------TFIRMEESVV--------WR

Query:  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIREIMHRRDVTVTQIASEHNVAD
        S KQ  +  S+ EAEY +    + E  W+   +T+L +   +  P  ++CDN GA      P  H R KHI   YH IR  +    + V  +++   +AD
Subjt:  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIREIMHRRDVTVTQIASEHNVAD

Query:  PFTKPLTAKVFKGHLESLGLRELP
          TKPL+   F+     +G+  +P
Subjt:  PFTKPLTAKVFKGHLESLGLRELP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-5528.47Show/hide
Query:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELV-DQLDRVKPIGCKWIYKRKRCVDGK-------
        M  R+   +R+P++    A TS+ A      +P T  QAM D   D   +AM  E+ +   N  W+LV      V  +GC+WI+ +K   DG        
Subjt:  MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELV-DQLDRVKPIGCKWIYKRKRCVDGK-------

Query:  -------------------------SIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEA
                                 SIRI+L +     + + Q+DV  AFL G L + +YM QP GF+ + +   VCRL ++IYG K+A R+W +     
Subjt:  -------------------------SIRILLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEA

Query:  IKSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNS
        + + GF  ++ +  ++     ++I ++++YVDDIL+ GN+   L      L+ +F +K+  ++HY LGI+    R  + L L Q  Y   +L+R  M  +
Subjt:  IKSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNS

Query:  KKSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL------------GNLLVV--
        K    P      L+     K P   E      Y   VGSL Y+   TR D+ YAV  +S+Y   P  DHW  +K +L+ L            GN L +  
Subjt:  KKSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILKIL------------GNLLVV--

Query:  -----------TFIRMEESVV--------WRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGK
                    ++     +V        W S KQ  +  S+ EAEY +    + E  W+   +T+L +   ++ P  ++CDN GA      P  H R K
Subjt:  -----------TFIRMEESVV--------WRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGK

Query:  HIERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP
        HI   YH IR  +    + V  +++   +AD  TKPL+   F+     +G+ ++P
Subjt:  HIERKYHLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.5e-4930.25Show/hide
Query:  EDPLTYDQAMVDVDKDECI--KAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG--------------------------------KSIRI
        ++P TY++A     K+  +   AMD E+ +M     WE+       KPIGCKW+YK K   DG                                 S+++
Subjt:  EDPLTYDQAMVDVDKDECI--KAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDG--------------------------------KSIRI

Query:  LLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIA-QGQE---QKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIA
        +LAI A Y++ + Q+D+  AFLNG LDE IYM  P G+ A QG       VC L +SIYG K+ASR W ++F   +  +GF Q+  +   + KI      
Subjt:  LLAIVAYYDYDVWQMDVKTAFLNGKLDETIYMDQPKGFIA-QGQE---QKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIA

Query:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDV
         +++YVDDI++  N    + ++K  L S F+++DLG + Y LG++I R+     + + Q  Y   +L    +   K S +P    V  S      +  D 
Subjt:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDV

Query:  EDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILK-ILGNLLVVTFIRMEES------------------------------
         D +   Y   +G LMY+ + TR DI +AV  +S++   P L H   V  IL  I G +    F   +                                
Subjt:  EDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILK-ILGNLLVVTFIRMEES------------------------------

Query:  --VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIRE
          + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  LFCDN+ A   +     H+R KHIE   H +RE
Subjt:  --VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein9.3e-1538.71Show/hide
Query:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK--KSLLPFRHGVHLSKDQCPKTPQ
        +L+LYVDDILL G+    L  +   L+S F MKDLG VHY LGIQI  +     L L Q  Y +++L+   M + K   + LP +    +S  + P    
Subjt:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSK--KSLLPFRHGVHLSKDQCPKTPQ

Query:  DVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILK
        D  D R     S VG+L Y+ L TR DI YAV IV +    P L  + ++K +L+
Subjt:  DVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQSNPGLDHWTVVKAILK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGACGTAGTGGGAGGGCTGTGAGACAGCCTGATCGCTACATGGGTTTAGCTGAAACCTCAGTTGTCGCTCCTGATGATGACTGTGAGGATCCATTGACCTATGA
TCAGGCAATGGTTGATGTTGACAAAGACGAGTGTATTAAAGCTATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCAACTGGATAGGG
TAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAACGTTGCGTAGATGGGAAGTCGATCAGGATCCTTCTGGCCATTGTCGCGTATTATGACTACGACGTATGGCAG
ATGGACGTCAAGACAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATGGACCAACCCAAAGGGTTCATCGCCCAAGGCCAAGAGCAAAAAGTTTGTCGGCTTCA
TAGGTCCATTTATGGTCAAAAGCGAGCTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACA
AGAAAATCGTTGACAAAACTATCGCATTTTTGGTGTTGTACGTGGACGATATTCTTCTCATTGGGAATGAGGTAGAATTTCTTACTGACGTTAAAAAGTGGCTAGCTTCG
CAATTCCAAATGAAAGATTTGGGAGAAGTTCACTATATTCTAGGTATCCAGATAGTCCGGAACCGGAAGAACAGAACGCTAGCCTTGTATCAGGCGTCTTATATTGACAA
GATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGAGCTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGATGTTGAGG
ATATGAGACGAATTCCATATGCTTCAGCTGTAGGGAGCTTGATGTATGTCATGTTGTGTACTAGGTCCGACATCTGTTATGCAGTAGGAATTGTCAGTAGATATCAGTCC
AATCCAGGATTAGATCACTGGACAGTCGTAAAGGCAATCCTCAAGATTCTAGGAAATCTACTTGTGGTCACCTTCATTCGAATGGAGGAGTCAGTAGTATGGCGAAGCAT
CAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCGGAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGTTCATGACAGATTTGGAAGTTA
TTCCAAATATGAACTTGCCGATCACACTGTTCTGTGATAACAGTGGTGCAGGAGTCAACTCACGTGAGCCTCGGAACCATAAGAGAGGCAAACACATTGAGCGCAAGTAT
CATTTGATACGGGAGATTATGCATCGCAGAGACGTGACGGTCACGCAGATAGCGTCAGAGCACAACGTAGCTGATCCATTTACAAAGCCCCTCACGGCTAAGGTGTTTAA
GGGTCACCTAGAGAGTCTAGGTCTTCGAGAGCTTCCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGACGTAGTGGGAGGGCTGTGAGACAGCCTGATCGCTACATGGGTTTAGCTGAAACCTCAGTTGTCGCTCCTGATGATGACTGTGAGGATCCATTGACCTATGA
TCAGGCAATGGTTGATGTTGACAAAGACGAGTGTATTAAAGCTATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCAACTGGATAGGG
TAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAACGTTGCGTAGATGGGAAGTCGATCAGGATCCTTCTGGCCATTGTCGCGTATTATGACTACGACGTATGGCAG
ATGGACGTCAAGACAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATGGACCAACCCAAAGGGTTCATCGCCCAAGGCCAAGAGCAAAAAGTTTGTCGGCTTCA
TAGGTCCATTTATGGTCAAAAGCGAGCTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACA
AGAAAATCGTTGACAAAACTATCGCATTTTTGGTGTTGTACGTGGACGATATTCTTCTCATTGGGAATGAGGTAGAATTTCTTACTGACGTTAAAAAGTGGCTAGCTTCG
CAATTCCAAATGAAAGATTTGGGAGAAGTTCACTATATTCTAGGTATCCAGATAGTCCGGAACCGGAAGAACAGAACGCTAGCCTTGTATCAGGCGTCTTATATTGACAA
GATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGAGCTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGATGTTGAGG
ATATGAGACGAATTCCATATGCTTCAGCTGTAGGGAGCTTGATGTATGTCATGTTGTGTACTAGGTCCGACATCTGTTATGCAGTAGGAATTGTCAGTAGATATCAGTCC
AATCCAGGATTAGATCACTGGACAGTCGTAAAGGCAATCCTCAAGATTCTAGGAAATCTACTTGTGGTCACCTTCATTCGAATGGAGGAGTCAGTAGTATGGCGAAGCAT
CAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCGGAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGTTCATGACAGATTTGGAAGTTA
TTCCAAATATGAACTTGCCGATCACACTGTTCTGTGATAACAGTGGTGCAGGAGTCAACTCACGTGAGCCTCGGAACCATAAGAGAGGCAAACACATTGAGCGCAAGTAT
CATTTGATACGGGAGATTATGCATCGCAGAGACGTGACGGTCACGCAGATAGCGTCAGAGCACAACGTAGCTGATCCATTTACAAAGCCCCTCACGGCTAAGGTGTTTAA
GGGTCACCTAGAGAGTCTAGGTCTTCGAGAGCTTCCTGACTAG
Protein sequenceShow/hide protein sequence
MPRRSGRAVRQPDRYMGLAETSVVAPDDDCEDPLTYDQAMVDVDKDECIKAMDQEMESMYFNSVWELVDQLDRVKPIGCKWIYKRKRCVDGKSIRILLAIVAYYDYDVWQ
MDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGQKRASRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTIAFLVLYVDDILLIGNEVEFLTDVKKWLAS
QFQMKDLGEVHYILGIQIVRNRKNRTLALYQASYIDKMLSRYKMQNSKKSLLPFRHGVHLSKDQCPKTPQDVEDMRRIPYASAVGSLMYVMLCTRSDICYAVGIVSRYQS
NPGLDHWTVVKAILKILGNLLVVTFIRMEESVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFMTDLEVIPNMNLPITLFCDNSGAGVNSREPRNHKRGKHIERKY
HLIREIMHRRDVTVTQIASEHNVADPFTKPLTAKVFKGHLESLGLRELPD