; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016701 (gene) of Snake gourd v1 genome

Gene IDTan0016701
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:24178283..24180696
RNA-Seq ExpressionTan0016701
SyntenyTan0016701
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-22068.83Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKI+LKYLRRTR+YMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-1088.89Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARGG+
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-22068.83Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKI+LKYLRRTR+YMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-0986.11Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARG +
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21767.48Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSELDGTITK------------------------------------WEGYNKPDRYM
        GYPK TRGG FYDPK++KVFVSTNATFLEEDHIR+HKPRS +VL+EL    T+                                        N P RYM
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSELDGTITK------------------------------------WEGYNKPDRYM

Query:  SLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------
        SL+ET  VI D D EDP T+ +AM D DKD+W+ AM+ E+ES+YFNSVWDLVD+PDGVKPIGCKWIYKRKRG DGK                        
Subjt:  SLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------

Query:  ---------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYK
                 IRILL+IAAY+DYE+W+MDVKTAFLN NL+E IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNIRFD  IKS+GF+Q  DE CVYK
Subjt:  ---------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYK

Query:  KIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQ
        +IIN SVAFLVLYVDDILLI ND+G LT IK WLATQFQMKDLGEA+FVLGIQI R+ KNK L LSQ SYIDK+++++ MQ+SK+GLLPFR+GV LSKEQ
Subjt:  KIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQ

Query:  CPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSG
        CPKTPQ VE+M+ IPYASAVGSLMYAMLCTRP+IC       +YQSNP    WTAVK ILKYLRRTR+Y LVYG+KDLILTGYTDSDFQTD+DSRKSTSG
Subjt:  CPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSG

Query:  SVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        SVFTLNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLR F++DLEVVPNMS PITLYCDNSG V NSREP
Subjt:  SVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

KAA0050437.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-22360.99Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGYP---------------------------------------------------------------
        MTKRPF+GKGY+AKEPLELIHS LCGPMN+KARGG+                                                                
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGYP---------------------------------------------------------------

Query:  -KETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSETK
         ++TRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPR+ +VLSE                 +D T T  + +               ++P+RY+ L+ET+
Subjt:  -KETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSETK

Query:  VVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------------
        VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFN VW+LVD P+GVKPIGCKWIYKRKR   GK                              
Subjt:  VVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------------

Query:  ---IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSS
           IRILL+IA +YDYE+WKMDVKTA LN NL+E+I+M Q +GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IK +GF+QN DE CVYKKI    
Subjt:  ---IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSS

Query:  VAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQ
        VAFLVLYVDDILLI ND+GYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNK L LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQC KTPQ
Subjt:  VAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQ

Query:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN
         VEDM++IPYASAVGSLMY MLCTRP IC       +YQSNP  + WTAVKIILKYLRRTR+YMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLN
Subjt:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN

Query:  GGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        GG VVWRS+KQGCIAD TME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V N +EP
Subjt:  GGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-1088.89Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARGG+
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-21868.48Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DPKE++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGE ++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKIILKYLRRTR+YMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWL+KF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-0880.56Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKG++AKEPLEL+HSDLCGPMN+KARG +
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.1e-1083.33Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKG++AKEPLEL+HSDLCGPMN+KARGG+
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

A0A5A7T2V9 Gag/pol protein6.1e-1086.11Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARG +
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

A0A5A7T2V9 Gag/pol protein1.7e-21767.48Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSELDGTITK------------------------------------WEGYNKPDRYM
        GYPK TRGG FYDPK++KVFVSTNATFLEEDHIR+HKPRS +VL+EL    T+                                        N P RYM
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSELDGTITK------------------------------------WEGYNKPDRYM

Query:  SLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------
        SL+ET  VI D D EDP T+ +AM D DKD+W+ AM+ E+ES+YFNSVWDLVD+PDGVKPIGCKWIYKRKRG DGK                        
Subjt:  SLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------

Query:  ---------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYK
                 IRILL+IAAY+DYE+W+MDVKTAFLN NL+E IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNIRFD  IKS+GF+Q  DE CVYK
Subjt:  ---------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYK

Query:  KIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQ
        +IIN SVAFLVLYVDDILLI ND+G LT IK WLATQFQMKDLGEA+FVLGIQI R+ KNK L LSQ SYIDK+++++ MQ+SK+GLLPFR+GV LSKEQ
Subjt:  KIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQ

Query:  CPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSG
        CPKTPQ VE+M+ IPYASAVGSLMYAMLCTRP+IC       +YQSNP    WTAVK ILKYLRRTR+Y LVYG+KDLILTGYTDSDFQTD+DSRKSTSG
Subjt:  CPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSG

Query:  SVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        SVFTLNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLR F++DLEVVPNMS PITLYCDNSG V NSREP
Subjt:  SVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

A0A5A7TZD0 Gag/pol protein3.6e-22068.83Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKI+LKYLRRTR+YMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

A0A5A7TZD0 Gag/pol protein1.2e-1088.89Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARGG+
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

A0A5A7TZD0 Gag/pol protein3.6e-22068.83Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKI+LKYLRRTR+YMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

A0A5A7U7T0 Gag/pol protein2.1e-22360.99Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGYP---------------------------------------------------------------
        MTKRPF+GKGY+AKEPLELIHS LCGPMN+KARGG+                                                                
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGYP---------------------------------------------------------------

Query:  -KETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSETK
         ++TRGGLF+DP+E++VFVSTNATFLEEDH+R+HKPR+ +VLSE                 +D T T  + +               ++P+RY+ L+ET+
Subjt:  -KETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSETK

Query:  VVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------------
        VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFN VW+LVD P+GVKPIGCKWIYKRKR   GK                              
Subjt:  VVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF-----------------------------

Query:  ---IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSS
           IRILL+IA +YDYE+WKMDVKTA LN NL+E+I+M Q +GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IK +GF+QN DE CVYKKI    
Subjt:  ---IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSS

Query:  VAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQ
        VAFLVLYVDDILLI ND+GYLT +K WLA QFQMKDLGEA++VLGIQI+R+ KNK L LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQC KTPQ
Subjt:  VAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQ

Query:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN
         VEDM++IPYASAVGSLMY MLCTRP IC       +YQSNP  + WTAVKIILKYLRRTR+YMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLN
Subjt:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN

Query:  GGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        GG VVWRS+KQGCIAD TME EYVAACE AK+ VWLRKF+ DLEVVPNM+LPITLYCDNSG V N +EP
Subjt:  GGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

A0A5A7UYE8 Gag/pol protein1.2e-1088.89Show/hide
Query:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY
        MTKRPF+GKGY+AKEPLELIHSDLCGPMN+KARGG+
Subjt:  MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGY

A0A5A7UYE8 Gag/pol protein2.6e-21868.48Show/hide
Query:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE
        GYPKETRGGLF+DPKE++VFVSTNATFLEEDH+R+HKPRS +VLSE                 +D T T  + +               ++P+RY+ L+E
Subjt:  GYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSE-----------------LDGTITKWEGY---------------NKPDRYMSLSE

Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------
        T+VVIPDD  EDP +Y QAM D DKD+WV AMD EMES+YFNSVW+LVD P+GVKPIGCKWIYKRKR                      GVD        
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKR----------------------GVDG-------

Query:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN
           K IRILL+IA +YDYE+W+MDVKTAFLN NL+E+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  IKS+GF+QN DE CVYKKI  
Subjt:  ---KFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIIN

Query:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT
          VAFLVLYVDDILLI NDVGYLT +K WLA QFQMKDLGE ++VLGIQI+R+ KNKTL LSQ +YIDK+L+R+ MQ+SKKGLLPFR+GVHLSKEQ PKT
Subjt:  SSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKT

Query:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT
        PQ VEDM++IPYASAVGSLMYAMLCTRP+IC       +YQSNP  + WTAVKIILKYLRRTR+YMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFT
Subjt:  PQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFT

Query:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
        LNGG VVWRS+KQGCIADSTME EYVAACE AK+ VWL+KF+ DLEVVPNM+LPITLYCDNSG V NS+EP
Subjt:  LNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-5528.72Show/hide
Query:  PSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRK----------------RGVDGKF----------------IRILLAI
        P+++++     DK  W  A++ E+ +   N+ W +  +P+    +  +W++  K                RG   K+                 R +L++
Subjt:  PSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRK----------------RGVDGKF----------------IRILLAI

Query:  AAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVY---KKIINSSVAFLVLY
           Y+ ++ +MDVKTAFLN  L E IYM  P+G         VC+L ++IYGLKQA+R W   F++ +K   F  +  +RC+Y   K  IN ++ +++LY
Subjt:  AAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVY---KKIINSSVAFLVLY

Query:  VDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLP----FRYGVHLSKEQCPKTPQGVE
        VDD+++   D+  +   K +L  +F+M DL E +  +GI+I    +   + LSQ++Y+ K+L +F M++      P      Y +  S E C        
Subjt:  VDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLP----FRYGVHLSKEQCPKTPQGVE

Query:  DMKQIPYASAVGSLMYAMLCTRPNI-CVFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN
             P  S +G LMY MLCTRP++    +   +Y S    E W  +K +L+YL+ T +  L++    A +  + GY DSD+   +  RKST+G +F + 
Subjt:  DMKQIPYASAVGSLMYAMLCTRPNI-CVFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN

Query:  GGDVV-WRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP
          +++ W + +Q  +A S+ E EY+A  E  ++ +WL+  +  + +   +  PI +Y DN G +  +  P
Subjt:  GGDVV-WRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-9239.15Show/hide
Query:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF---------------------------
        T+ V+  DD  +P +  + +   +K++ + AM +EMES+  N  + LV+ P G +P+ CKW++K K+  D K                            
Subjt:  TKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF---------------------------

Query:  -----IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVY-KKII
             IR +L++AA  D E+ ++DVKTAFL+ +L+E IYM+QP+GF   G++  VC+L +S+YGLKQA R W ++FD  +KS  + +   + CVY K+  
Subjt:  -----IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVY-KKII

Query:  NSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPK
         ++   L+LYVDD+L++  D G +  +K  L+  F MKDLG A+ +LG++IVR   ++ L LSQ  YI++VL RF M+++K    P    + LSK+ CP 
Subjt:  NSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPK

Query:  TPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF
        T +   +M ++PY+SAVGSLMYAM+CTRP+I        ++  NP  E W AVK IL+YLR T    L +G  D IL GYTD+D   D D+RKS++G +F
Subjt:  TPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVF

Query:  TLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSR
        T +GG + W+S  Q C+A ST E EY+AA E  K+ +WL++F+ +L +         +YCD+   +  S+
Subjt:  TLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSR

P25600 Putative transposon Ty5-1 protein YCL074W1.9e-3232.9Show/hide
Query:  MDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFLVLYVDDILLIKNDVGY
        MDV TAFLN  +DE IY+ QP GF+       V  L   +YGLKQA   WN   + T+K  GF +++ E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFLVLYVDDILLIKNDVGY

Query:  LTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVEDMKQIPYASAVGSLMYA
           +K  L   + MKDLG+    LG+ I     N  +TLS   YI K     ++   K    P    +  SK     T   ++D+   PY S VG L++ 
Subjt:  LTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVEDMKQIPYASAVGSLMYA

Query:  MLCTRPNICV-FSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGDVVWRSVK-QGCIADST
            RP+I    S   ++   PR     + + +L+YL  TR+  L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   +
Subjt:  MLCTRPNICV-FSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGDVVWRSVK-QGCIADST

Query:  MEVEYVAACE
         E EY+ A E
Subjt:  MEVEYVAACE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-4930.18Show/hide
Query:  DPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDG-VKPIGCKWIYKRKRGVDGKF--------------------------------IRILL
        +P T  QA+ D   ++W  AM  E+ +   N  WDLV  P   V  +GC+WI+ +K   DG                                  IRI+L
Subjt:  DPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDG-VKPIGCKWIYKRKRGVDGKF--------------------------------IRILL

Query:  AIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFLVLYV
         +A    + + ++DV  AFL   L +++YM QP GFI+  +   VC+L++++YGLKQA R+W +     + + GF  +  +  ++      S+ ++++YV
Subjt:  AIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFLVLYV

Query:  DDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVEDMKQI
        DDIL+  ND   L    + L+ +F +KD  E  + LGI+  R      L LSQ  YI  +L R  M  +K    P      LS     K     E     
Subjt:  DDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVEDMKQI

Query:  PYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGDVVWR
         Y   VGSL Y +  TRP+I    +   Q+   P  E   A+K IL+YL  T N+ + +     L L  Y+D+D+  DKD   ST+G +  L    + W 
Subjt:  PYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGDVVWR

Query:  SVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSG
        S KQ  +  S+ E EY +    + +  W+   + +L +   ++ P  +YCDN G
Subjt:  SVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.9e-5029.02Show/hide
Query:  EGYNKPDRYMSLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLV-DKPDGVKPIGCKWIYKRKRGVDGKF------------
        +G  KP++  S + +          +P T  QAM D   D+W  AM  E+ +   N  WDLV   P  V  +GC+WI+ +K   DG              
Subjt:  EGYNKPDRYMSLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLV-DKPDGVKPIGCKWIYKRKRGVDGKF------------

Query:  --------------------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGF
                            IRI+L +A    + + ++DV  AFL   L + +YM QP GF++  +   VCRL+++IYGLKQA R+W +     + + GF
Subjt:  --------------------IRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGF

Query:  NQNDDERCVYKKIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLP
          +  +  ++      S+ ++++YVDDIL+  ND   L    + L+ +F +K+  +  + LGI+  R    + L LSQ  Y   +L R  M  +K    P
Subjt:  NQNDDERCVYKKIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLP

Query:  FRYGVHLSKEQCPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDF
              L+     K P   E      Y   VGSL Y +  TRP++    +   QY   P  + W A+K +L+YL  T ++ + +     L L  Y+D+D+
Subjt:  FRYGVHLSKEQCPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDF

Query:  QTDKDSRKSTSGSVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSG
          D D   ST+G +  L    + W S KQ  +  S+ E EY +    + +  W+   + +L +   +S P  +YCDN G
Subjt:  QTDKDSRKSTSGSVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-5731.02Show/hide
Query:  EDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF--------------------------------IRILL
        ++PSTYN+A   K+   W  AMD E+ ++     W++   P   KPIGCKW+YK K   DG                                  ++++L
Subjt:  EDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKF--------------------------------IRILL

Query:  AIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFI----EPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFL
        AI+A Y++ L ++D+  AFLN +LDE IYM  P G+     +      VC LK+SIYGLKQASR W ++F  T+  FGF Q+  +   + KI  +    +
Subjt:  AIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFI----EPGQEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFL

Query:  VLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVED
        ++YVDDI++  N+   +  +K+ L + F+++DLG  ++ LG++I R+     + + Q  Y   +L    +   K   +P    V  S         G + 
Subjt:  VLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVED

Query:  MKQIPYASAVGSLMYAMLCTRPNICVFSWN--GQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGG
        +    Y   +G LMY  + TR +I  F+ N   Q+   PR     AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L   
Subjt:  MKQIPYASAVGSLMYAMLCTRPNICVFSWN--GQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGG

Query:  DVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIV
         + W+S KQ  ++ S+ E EY A      + +WL +F  +L++   +S P  L+CDN+  +
Subjt:  DVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.2e-1529.66Show/hide
Query:  FLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSK--KGLLPFRYGVHLSKEQCPKTPQ
        +L+LYVDDILL  +    L  +   L++ F MKDLG   + LGIQI  +     L LSQT Y +++L    M D K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLTLSQTSYIDKVLLRFKMQDSK--KGLLPFRYGVHLSKEQCPKTPQ

Query:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL
                 + S VG+L Y  L TRP+I    +   Q    P    +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L
Subjt:  GVEDMKQIPYASAVGSLMYAMLCTRPNIC-VFSWNGQYQSNPRPERWTAVKIILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTL

Query:  NGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVW
            + W + +Q  ++ S+ E EY A    A +  W
Subjt:  NGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.9e-0529.47Show/hide
Query:  GYNKPDRYMSLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKFIRILLAIAA
        G NK +   SL+ T  +      ++P +   A+ D     W  AM +E++++  N  W LV  P     +GCKW++K K   DG   R+   + A
Subjt:  GYNKPDRYMSLSETKVVIPDDDCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKFIRILLAIAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAAACGACCTTTTAGTGGAAAAGGTTACAAAGCCAAAGAACCTCTTGAGCTTATACATTCAGACCTTTGTGGTCCGATGAATATTAAAGCTAGAGGAGGCTACCC
AAAAGAAACAAGAGGTGGATTATTCTATGATCCTAAGGAAGATAAGGTATTTGTGTCGACAAATGCCACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAGAA
GTACAGTTGTGTTAAGTGAGTTAGACGGAACAATAACAAAGTGGGAGGGTTATAACAAACCTGACCGTTACATGAGTTTATCTGAAACCAAAGTTGTTATACCAGATGAC
GACTGTGAGGATCCATCGACTTATAATCAAGCGATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCTATTTACTTCAATTCTGTTTG
GGATCTTGTAGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAGCGTGGTGTAGATGGAAAGTTTATCCGTATCCTACTTGCCATTGCCG
CATATTATGACTATGAGCTATGGAAAATGGATGTCAAGACAGCATTCTTGAATGACAATCTTGACGAGAACATCTACATGGACCAACCCAAAGGGTTCATTGAACCAGGA
CAAGAACAAAAGGTTTGTAGGCTTAAAAGGTCAATCTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGACAATCAAATCTTTTGGCTTTAATCA
GAATGATGATGAACGTTGTGTCTACAAGAAAATCATCAATAGTTCTGTCGCATTCCTAGTTCTATATGTGGATGATATCCTACTCATTAAGAATGATGTAGGTTACCTTA
CTGGCATTAAGAATTGGCTAGCTACACAATTCCAAATGAAAGATTTGGGTGAAGCGCGATTTGTTCTTGGGATCCAGATTGTCCGAAACTGCAAGAATAAAACATTAACC
CTGTCTCAGACATCTTACATCGACAAAGTGTTGTTGAGGTTTAAGATGCAAGACTCCAAAAAGGGTTTATTGCCTTTTAGATATGGAGTTCATTTGTCTAAGGAACAGTG
TCCTAAGACACCTCAAGGAGTTGAGGATATGAAACAGATTCCTTATGCATCAGCTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAGGCCAAACATCTGTGTTTTTA
GTTGGAATGGTCAGTATCAATCCAATCCAAGACCTGAACGCTGGACAGCGGTTAAAATAATCCTTAAGTATCTACGGAGAACAAGAAACTACATGCTTGTGTATGGGGCT
AAGGATTTGATCCTTACAGGATACACAGATTCTGACTTTCAAACTGATAAAGATTCTCGGAAATCCACATCAGGGTCAGTATTTACTCTTAACGGAGGAGATGTAGTATG
GCGAAGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGTCGAATATGTAGCAGCTTGTGAAGTAGCTAAAAAGCCCGTTTGGTTAAGGAAATTCATGTTAGATT
TGGAAGTTGTTCCAAATATGAGTTTGCCCATCACACTGTATTGTGATAATAGTGGTATCGTGAGAAATTCAAGGGAACCCGAAGTCACAAGAGGAGAAAGCATATTGAGC
GAAAATATCATCTCATCCAGGAGATCGTGCATCGAGGAGACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAAACGACCTTTTAGTGGAAAAGGTTACAAAGCCAAAGAACCTCTTGAGCTTATACATTCAGACCTTTGTGGTCCGATGAATATTAAAGCTAGAGGAGGCTACCC
AAAAGAAACAAGAGGTGGATTATTCTATGATCCTAAGGAAGATAAGGTATTTGTGTCGACAAATGCCACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAGAA
GTACAGTTGTGTTAAGTGAGTTAGACGGAACAATAACAAAGTGGGAGGGTTATAACAAACCTGACCGTTACATGAGTTTATCTGAAACCAAAGTTGTTATACCAGATGAC
GACTGTGAGGATCCATCGACTTATAATCAAGCGATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCTATTTACTTCAATTCTGTTTG
GGATCTTGTAGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAGCGTGGTGTAGATGGAAAGTTTATCCGTATCCTACTTGCCATTGCCG
CATATTATGACTATGAGCTATGGAAAATGGATGTCAAGACAGCATTCTTGAATGACAATCTTGACGAGAACATCTACATGGACCAACCCAAAGGGTTCATTGAACCAGGA
CAAGAACAAAAGGTTTGTAGGCTTAAAAGGTCAATCTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGACAATCAAATCTTTTGGCTTTAATCA
GAATGATGATGAACGTTGTGTCTACAAGAAAATCATCAATAGTTCTGTCGCATTCCTAGTTCTATATGTGGATGATATCCTACTCATTAAGAATGATGTAGGTTACCTTA
CTGGCATTAAGAATTGGCTAGCTACACAATTCCAAATGAAAGATTTGGGTGAAGCGCGATTTGTTCTTGGGATCCAGATTGTCCGAAACTGCAAGAATAAAACATTAACC
CTGTCTCAGACATCTTACATCGACAAAGTGTTGTTGAGGTTTAAGATGCAAGACTCCAAAAAGGGTTTATTGCCTTTTAGATATGGAGTTCATTTGTCTAAGGAACAGTG
TCCTAAGACACCTCAAGGAGTTGAGGATATGAAACAGATTCCTTATGCATCAGCTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAGGCCAAACATCTGTGTTTTTA
GTTGGAATGGTCAGTATCAATCCAATCCAAGACCTGAACGCTGGACAGCGGTTAAAATAATCCTTAAGTATCTACGGAGAACAAGAAACTACATGCTTGTGTATGGGGCT
AAGGATTTGATCCTTACAGGATACACAGATTCTGACTTTCAAACTGATAAAGATTCTCGGAAATCCACATCAGGGTCAGTATTTACTCTTAACGGAGGAGATGTAGTATG
GCGAAGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGTCGAATATGTAGCAGCTTGTGAAGTAGCTAAAAAGCCCGTTTGGTTAAGGAAATTCATGTTAGATT
TGGAAGTTGTTCCAAATATGAGTTTGCCCATCACACTGTATTGTGATAATAGTGGTATCGTGAGAAATTCAAGGGAACCCGAAGTCACAAGAGGAGAAAGCATATTGAGC
GAAAATATCATCTCATCCAGGAGATCGTGCATCGAGGAGACGTGA
Protein sequenceShow/hide protein sequence
MTKRPFSGKGYKAKEPLELIHSDLCGPMNIKARGGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSTVVLSELDGTITKWEGYNKPDRYMSLSETKVVIPDD
DCEDPSTYNQAMVDKDKDKWVIAMDQEMESIYFNSVWDLVDKPDGVKPIGCKWIYKRKRGVDGKFIRILLAIAAYYDYELWKMDVKTAFLNDNLDENIYMDQPKGFIEPG
QEQKVCRLKRSIYGLKQASRSWNIRFDETIKSFGFNQNDDERCVYKKIINSSVAFLVLYVDDILLIKNDVGYLTGIKNWLATQFQMKDLGEARFVLGIQIVRNCKNKTLT
LSQTSYIDKVLLRFKMQDSKKGLLPFRYGVHLSKEQCPKTPQGVEDMKQIPYASAVGSLMYAMLCTRPNICVFSWNGQYQSNPRPERWTAVKIILKYLRRTRNYMLVYGA
KDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGDVVWRSVKQGCIADSTMEVEYVAACEVAKKPVWLRKFMLDLEVVPNMSLPITLYCDNSGIVRNSREPEVTRGESILS
ENIISSRRSCIEET